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(54) INFORMATION PROCESSING APPARATUS AND METHOD, PROGRAM, AND RECORDED 
MEDIUM 



(57) In case continuous reproduction is commanded 
from a first AV stream to a second AV stream, a third AV 
stream, made up of a preset portion of the first AV 
stream and a preset portion of the second AV stream, 
is generated. The third AV stream is reproduced when 
reproduction is switched from the first AV stream to the 
second AV stream. As the information pertinent to the 



third AV stream, the address information of a source 
packet of the first AV stream at a timing of switching from 
the first AV stream to the third AV stream and the ad- 
dress information of a source packet of the second AV 
stream at a timing of switching from the third AV stream 
to the second AV stream, are generated. This enables 
reproduction such as to maintain continuity between 
separately recorded AV streams. 
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EP1 198 133 A1 

Description 

Technical Field 

5 [0001 ] This invention relates to an information processing method and apparatus, a program, and a recording medium 
and, more particularly, to an information processing method and apparatus, a program and a recording medium, con- 
figured for maintaining continuity of moving pictures in a reproducing domain. 

Background Art 

w 

[0002] Recently, a variety of types of optical discs have been proposed as a recording medium that can be removed 
from a recording apparatus. These recordable optical discs have been proposed as a large capacity medium of several 
GBs and are thought to be promising as a medium for recording AV (audio visual) signals, such as video signals. 
Among the digital AV signal sources (supply sources), to be recorded on this recordable optical disc, there are CS 
is digital satellite broadcast and BS digital broadcast. Additionally, the ground wave television broadcast of the digital 
system has also been proposed for future use. 

[0003] The digital video signals, supplied from these sources, are routinely compressed under the MPEG (Moving 
Picture Experts Group) 2 system. In a recording apparatus, a recording rate proper to the apparatus is set. If digital 
video signals of the digital broadcast are recorded in the conventional image storage mediums for domestic use, digital 
20 video signals are first decoded and subsequently bandwidth-limited for recording. In the case of the digital recording 
system, including, of course, the MPEG1 Video, MPEG2 video and The present invention provides an DV systems, 
digital video signals are first decoded and subsequently re-encoded in accordance with an encoding system for the 
recording rate proper to the apparatus for recording subsequently. 

[0004] However, this recording system, in which the supplied bitstream is decoded once and subsequently band- 
25 width-limited and re-encoded prior to recording, suffers from deteriorated picture quality. If, in recording compressed 
video digital signals, the transmission rate of input digital signals is less than the recording rate for the recording and/ 
or reproducing apparatus, the method of directly recording the supplied bitstream without decoding or re-encoding 
suffers from deterioration in the picture quality only to the least extent. However, if the transmission rate of the input 
digital signals exceeds the recording rate of the recording and/or reproducing apparatus, it is indeed necessary to re- 
30 encode the bitstream and to record the so-re-encoded bitstream, so that, after decoding in the recording and/or repro- 
ducing apparatus, the transmission rate will be not higher than the upper limit of the disc recording rate. 
[0005] If the bitstream is transmitted in a variable rate system in which the bit rate of the input digital signal is increased 
or decreased with time, the capacity of the recording medium can be exploited less wastefully in the case of a disc 
recording apparatus adapted for transiently storing data in a buffer and for recording the data in a burst fashion than 
35 in the case of a tape recording system having a fixed recording rate imposed by the fixed rpm of the rotary head. 

[0006] Thus, it may be predicted that, in the near future when the digital broadcast is to become the mainstream, an 
increasing demand will be raised for a recording and/or reproducing apparatus in which broadcast signals are recorded 
as digital signals, without decoding or re-encoding, as in a data streamer, and in which a disc is used as a recording 
medium. 

40 [0007] In reproducing data recorded on a recording medium in the above-described recording apparatus, there is 
known so-called skip reproduction in which reproduction is continued up to a preset picture and a picture temporally 
spaced apart from the preset picture is reproduced next. This skip reproduction has a drawback that temporal continuity 
is lost in the reproduced pictures. 

45 Disclosure of the Invention 

[0008] It is therefore an object of the present invention to enable reproduction such as to maintain continuity of moving 
pictures in the playback domain. 

[0009] The present invention provides an information processing apparatus including generating means for gener- 
50 ating, in case continuous reproduction from a first AV stream to a second AV stream is commanded, a third AV stream 
made up of a preset portion of the first AV stream and a preset portion of the second AV stream, the third AV stream 
being reproduced when reproduction is switched from the first AV stream to the second AV stream, and the address 
information, as the information pertinent to the third AV stream, the address information being made up of the information 
on an address of a source packet of the first AV stream at a timing of switching of reproduction from the first AV stream 
55 to the third AV stream, and of the information on the address of a source packet of the second AV stream at a timing 
of switching of reproduction from the third AV stream to the second AV stream, and recording means for recording the 
third AV stream and the address information, as generated by the generating means. 

[001 0] Preferably, an arrival time stamp of the source packet of the first AV stream included in the address information 
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generated by the generating means, and an arrival time stamp of a source packet located at a leading end of the third 
AV stream, are continuous to each other, and an arrival time stamp of the source packet of the second AV stream 
included in the address information generated by the generating means and an arrival time stamp of a source packet 
located at a trailing end of the third AV stream are continuous to each other. 
5 [0011] Preferably, a sole discontinuous point exists in the arrival time stamp of the source packet in the third AV 
stream. 

[0012] Preferably, the address is determined so that a data portion of the AV stream previous to a source packet 
specified by the information on the address of the source packet of the first AV stream contained in the address infor- 
mation generated by the generating means will be located in a continuous area of not less than a preset size on a 
10 recording medium. 

[0013] Preferably, the address is determined so that a data portion of the AV stream subsequent to a source packet 
specified by the information on the address of the source packet of the second AV stream contained in the address 
information generated by the generating means will be located in a continuous area of not less than a preset size on 
a recording medium. 

15 [0014] Preferably, the third AV stream is generated so that the third AV stream will be located in a continuous area 
of not less than a preset size on the recording medium. 

[0015] The present invention also provides an information generating method including a step of generating, in case 
continuous reproduction from a first AV stream to a second AV stream is commanded, a third AV stream made up of 
a preset portion of the first AV stream and a preset portion of the second AV stream, the third AV stream being repro- 

20 duced when reproduction is switched from the first AV stream to the second AV stream, and a step of generating the 
address information, as the information pertinent to the third AV stream, the address information being made up of the 
information on an address of a source packet of the first AV stream at a timing of switching of reproduction from the 
first AV stream to the third AV stream, and of the information on the address of a source packet of the second AV 
stream at a timing of switching of reproduction from the third AV stream to the second AV stream. 

25 [0016] The present invention also provides a recording medium having recorded thereon a computer-readable pro- 
gram, in which the program includes a step of generating, in case continuous reproduction from a first AV stream to a 
second AV stream is commanded, a third AV stream made up of a preset portion of the first AV stream and a preset 
portion of the second AV stream, the third AV stream being reproduced when reproduction is switched from the first 
AV stream to the second AV stream, and a step of generating the address information, as the information pertinent to 

30 the third AV stream, the address information being made up of the information on an address of a source packet of 
the first AV stream at a timing of switching of reproduction from the first AV stream to the third AV stream, and of the 
information on the address of a source packet of the second AV stream at a timing of switching of reproduction from 
the third AV stream to the second AV stream. 

[0017] The present invention also provides a program for having a computer execute a program, in which the program 
35 includes a step of generating, in case continuous reproduction from a first AV stream to a second AV stream is com- 
manded, a third AV stream made up of a preset portion of the first AV stream and a preset portion of the second AV 
stream, the third AV stream being reproduced when reproduction is switched from the first AV stream to the second 
AV stream, and a step of generating the address information, as the information pertinent to the third AV stream, the 
address information being made up of the information on an address of a source packet of the first AV stream at a 
40 timing of switching of reproduction from the first AV stream to the third AV stream, and of the information on the address 
of a source packet of the second AV stream at a timing of switching of reproduction from the third AV stream to the 
second AV stream. 

[0018] The present invention also provides an information processing apparatus including first readout means for 
reading out a first AV stream, a second AV stream or a third AV stream from a recording medium, second readout 

45 means for reading out, as the information pertinent to the third AV stream, the information on an address of a source 
packet of the first AV stream at a timing of switching of reproduction from the first AV stream to the third AV stream, 
and of the information on the address of a source packet of the second AV stream at a timing of switching of reproduction 
from the third AV stream to the second AV stream, and reproducing means for performing reproduction as reproduction 
is switched from the first AV stream read out by the first readout means to the third AV stream and from the third AV 

so stream to the second AV stream, based on the information pertinent to the third AV stream read out by the second 
readout means. 

[0019] The present invention also provides an information processing method including a first readout controlling 
step of reading out a first AV stream, a second AV stream or a third AV stream from a recording medium, a second 
readout controlling step of reading out, as the information pertinent to the third AV stream, the information on an address 
55 of a source packet of the first AV stream at a timing of switching of reproduction from the first AV stream to the third 
AV stream, and of the information on the address of a source packet of the second AV stream at a timing of switching 
of reproduction from the third AV stream to the second AV stream, and a reproducing step of performing reproduction 
as reproduction is switched from the first AV stream read out by the first readout means to the third AV stream and 
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from the third AV stream to the second AV stream, based on the information pertinent to the third AV stream read out 
by the second readout means. 

[0020] The present invention also provides a recording medium having recorded thereon a computer-readable pro- 
gram, in which the program includes a first readout controlling step of reading out a first AV stream, a second AV stream 

5 or a third AV stream from a recording medium, a second readout controlling step of reading out, as the information 
pertinent to the third AV stream, the information on an address of a source packet of the first AV stream at a timing of 
switching of reproduction from the first AV stream to the third AV stream, and of the information on the address of a 
source packet of the second AV stream at a timing of switching of reproduction from the third AV stream to the second 
AV stream, and a reproducing step of performing reproduction as reproduction is switched from the first AV stream 

io read out by the first readout means to the third AV stream and from the third AV stream to the second AV stream, based 
on the information pertinent to the third AV stream read out by the second readout means. 

[0021 ] The present invention also provides a program for having a computer execute a program, in which the program 
includes a first readout controlling step of reading out a first AV stream, a second AV stream or a third AV stream from 
a recording medium, a second readout controlling step of reading out, as the information pertinent to the third AV 

'5 stream, the information on an address of a source packet of the first AV stream at a timing of switching of reproduction 
from the first AV stream to the third AV stream, and of the information on the address of a source packet of the second 
AV stream at a timing of switching of reproduction from the third AV stream to the second AV stream, and a reproducing 
step of performing reproduction as reproduction is switched from the first AV stream read out by the first readout means 
to the third AV stream and from the third AV stream to the second AV stream, based on the information pertinent to 

20 the third AV stream read out by the second readout means. 

[0022] The present invention also provides a recording medium having recorded thereon the address information 
including, in case continuous reproduction from a first AV stream to a second AV stream is commanded, a third AV 
stream made up of a preset portion of the first AV stream and a preset portion of the second AV stream, the third AV 
stream being reproduced when reproduction is switched from the first AV stream to the second AV stream, and the 

25 address information, as the information pertinent to the third AV stream, the address information being made up of the 
information on an address of a source packet of the first AV stream at a timing of switching of reproduction from the 
first AV stream to the third AV stream, and of the information on the address of a source packet of the second AV 
stream at a timing of switching of reproduction from the third AV stream to the second AV stream. 
[0023] In the information processing method and apparatus, and the program, according to the present invention, if 

30 the it is commanded to perform reproduction continuously from the first AV stream to the second AV stream, there is 
generated a third AV stream made up of a preset portion of the first AV stream and a preset portion of the second AV 
stream and which is reproduced when reproduction is switched from the first AV stream to the second AV stream, while 
there is generated, as the information pertinent to the third AV stream, the address information made up of the infor- 
mation on an address of a source packet of the first AV stream at a timing of switching of reproduction from the first 

35 AV stream to the third AV stream, and of the information on the address of a source packet of the second AV stream 
at a timing of switching of reproduction from the third AV stream to the second AV stream. 

[0024] In the information processing method and apparatus, and the program, according to the present invention, a 
first AV stream, a second AV stream or a third AV stream is read out from a recording medium, the address information 
made up of the information on an address of a source packet of the first AV stream at a timing of switching of reproduction 

40 from the first AV stream to the third AV stream, and the information on the address of a source packet of the second 
AV stream at a timing of switching of reproduction from the third AV stream to the second AV stream, is read out from 
the recording medium as the information pertinent to the third AV stream, and reproduction is switched from the first 
AV stream to the third AV stream and from the third AV stream to the second AV stream, as reproduction proceeds, 
based on the read-out information pertinent to the third AV stream. 

45 [0025] Other objects, features and advantages of the present invention will become more apparent from reading the 
embodiments of the present invention as shown in the drawings. 

Brief Description of the Drawings 

so [0026] 

Fig.1 shows a configuration of an embodiment of a recording and/or reproducing apparatus according to the present 
invention. 

Fig.2 illustrates the data format of data recorded on a recording medium by a recording and/or reproducing appa- 
55 ratus 1 . 

Fig.3 illustrates Real PlayList and Virtual Play List. 

Figs.4A, 4B and 4C illustrate the creation of the Real PlayList. 

Figs.5A, 5B and 5C illustrate deletion of the Real PlayList. 
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Figs.6A and 6B illustrate assemble editing. 
Fig.7 illustrates provision of a sub path in the Virtual PlayList. 
Fig. 8 illustrates the changing of the playback sequence of the PlayList. 
Fig.9 illustrates a mark on the PlayList and a mark on the Clip. 
5 Fig.1 0 illustrates a menu thumbnail. 

Fig. 11 illustrates mark added to the PlayList. 
Fig. 12 illustrates a mark added to the Clip. 

Fig. 13 illustrates the relation between the PlayList, Clip and the thumbnail file. 

Fig.1 4 illustrates a directory structure. 
w Fig.1 5 illustrates a syntax of infr.dvr. 

Fig.1 6 shows a syntax of DVRVolume. 

Fig.1 7 shows a syntax of ResumeVolume. 

Fig.1 8 shows a syntax of UlApplnfoVolume. 

Fig.1 9 shows a table of character set values. 
*5 Fig.20 shows a syntax of TableOf PlayList. 

Fig,21 shows another syntax of TableOfPlayList. 

Fig.22 shows a syntax of the Makers PrivateData. 

Fig.23 shows a syntax of xxxx.rpls and yyyy.vpls. 

Figs.24A to 24C illustrate the PlayList. 
20 Fig.25 shows a syntax of PlayList. 

Fig.26 shows a table of PlayList__type. 

Fig.27 shows a syntax of UlApplnfoPlayList. 

Figs.28A to 28C illustrate flags in the UlApplnfoPlayList syntax shown in Fig.27. 

Fig. 29 illustrates a Playltem. 
25 Fig.30 illustrates a Playltem. 

Fig.31 illustrates a Playltem. 

Fig.32 shows a syntax of the Playltem. 

Fig.33 illustrates IN-time. 

Fig. 34 illustrates OUT-time. 
30 Fig.35 shows a table of Connection_Condition. 

Figs.36A to 36D illustrate Connection_Condition. 

Fig.37 illustrates BridgeSequencelnfo. 

Fig.38 shows a syntax of BridgeSequencelnfo. 

Fig,39 illustrates SubPlayltem. 
35 Fig.40 shows a syntax of SubPlayltem. 

Fig. 41 shows a table of Mark_type. 

Fig.42 shows a syntax of PlayListMark. 

Fig,43 shows a table of Mark_type. 

Fig.44 illustrates Mark_time_stamp. 
40 Fig.45 shows a syntax of zzzzz.clip. 

Fig.46 shows a syntax of Cliplnfo. 

Fig.47 shows a table of Clip_stream_type. 

Fig,48 illustrates offset_SPN. 

Fig.49 illustrates offset_SPN. 
45 Figs.50A, 50B illustrate the STC domain. 

Fig.51 illustrates STCJnfo. 

Fig. 52 shows a syntax of STCJnfo. 

Fig.53 illustrates Programlnfo. 

Fig.54 shows a syntax of Programlnfo. 
50 Fig. 55 shows a syntax of VideoCondinglnfo. 

Fig.56 shows a table of Videojormat. 

Fig.57 shows a table of frame_rate. 

Fig.58 shows a table of display_aspect_ratio. 

Fig.59 shows a syntax of AudioCondinglnfo. 
55 Fig.60 shows a table of audio_coding. 

Ftg.61 shows a table of audio_component_type. 

Fig.62 shows a table of samplingjrequency. 

Fig.63 illustrates CPI. 
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Fig.64 illustrates CPI. 

Fig.65 shows a syntax of CPI. 

Fig. 66 shows a table of CPIJype. 

Fig.67 illustrates video EPjmap. 
5 Fig.68 illustrates EPjnap. 

Fig.69 illustrates EP_map. 

Fig. 70 shows a syntax of EP_map. 

Fig.71 shows a table of EPJypevalues. 

Fig. 72 shows a syntax of EP_map_for_one_stream_PID. 
io Fig.73 illustrates TU_map. 

Fig. 74 shows a syntax of TU_map. 

Fig.75 shows a syntax of ClipMark. 

Fig. 76 shows a table of Markjype. 

Fig.77 shows a table of Mark_type_stamp. 
15 Fig.78 shows a syntax of menu.thmb and mark.thmb. 

Fig. 79 shows the syntax of thumbnail. 

Fig.80 shows a table of thumbnail _picture_format. 

Figs.81 A and 81 Billustrate tn_block. 

Fig.82 illustrates a structure of a transport stream of DVR MPEG2. 
20 Fig.83 shows a recorder model of a transport stream of DVR MPEG2. 

Fig.84 shows a player model of a transport stream of DVR MPEG2. 

Fig. 85 shows the syntax of a source packet. 

Fig. 86 shows the syntax of TP_extra_header. 

Fig. 87 shows a table of a copy permission indicator. 
25 Fig.88 illustrates seamless connection. 

Fig. 89 illustrates seamless connection. 

Fig. 90 illustrates seamless connection. 

Fig. 91 illustrates seamless connection. 

Fig. 92 illustrates seamless connection. 
30 Fig.93 illustrates audio overlap. 

Fig.94 illustrates seamless connection employing BridgeSequence. 

Fig.95 illustrates seamless connection not employing BridgeSequence. 

Fig.96 shows a DVR STD model. 

Fig.97 is a timing chart for decoding and display. 
35 Fig.98 shows another syntax of BridgeSequencelnfo. 

Fig.99 illustrates Bridge_Clip when two Playltems are seamlessly connected. 

Fig. 100 shows a syntax of a Clip Information file. 

Fig. 101 shows a syntax of a Cliplnfo of the Clip Information file of Fig. 100. 

Fig. 102 shows a syntax of a Sequencelnfo of the Clip information file of Fig. 100. 
40 Figs.1 03A and 1 03B illustrate database change in case stream data of the Clip AV stream file is partially erased. 

Fig. 104 is a flowchart for illustrating the preparation of a RealPlayList. 

Fig.1 05 is a flowchart for illustrating the formulation of a Virtual PlayList. 

Fig.1 06 is a flowchart for illustrating the formulation of a bridge sequence. 

Fig. 107 is a flowchart for illustrating the reproduction of PlayList. 
45 Fig, 1 08 illustrates a medium. 

Best Mode for Carrying out the Invention 

[0027] Referring to the drawings, present embodiment of the present invention will be explained in detail. Fig.1 shows 
so a typical inner structure of a recording and/or reproducing apparatus 1 embodying the present invention. First, the 
structure of a recording unit 2, configured for recording signals input from outside, is explained. The recording and/or 
reproducing apparatus 1 is configured for being fed with and recording analog or digital data. 
[0028] Analog video signals and analog audio signals are fed to terminals 11 , 12, respectively. The video signals, 
input to the terminal 11, are output to an analysis unit 14 and to an AV encoder 15. The audio signals, input to the 
55 terminal 12, are output to the analysis unit 14 and to the AV encoder 15. The analysis unit 14 extracts feature points, 
such as scene changes, from the input video and audio signals. 

[0029] The AV encoder 15 encodes input video and audio signal to output the system information (S), such as an 
encoded video stream (V), an encoded audio stream (A) and AV synchronization, to a multiplexer 16. 
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[0030] The encoded video stream is a video stream encoded e.g., with the MPEG (Moving Picture Expert Group) 2 
system, whilst the encoded audio stream is an audio stream encoded in accordance with the MPEG1 system, with the 
encoded audio stream being e.g., an audio stream encoded in e.g., the MPEG1 system or an audio stream encoded 
in accordance with the Dolby AC3 (trademak) system. The multiplexer 1 6 multiplexes the input video and audio streams, 
5 based on the input system information, to output a multiplexed stream through a switch 17 to a multiplexed stream 
analysis unit 18 and to a source packetizer 19. 

[0031] The multiplexed stream is e.g., an MPEG-2 transport stream or an MPEG2 program stream. The source 
packetizer 19 encodes the input multiplexed stream into an AV stream composed of source packets in accordance 
with an application format of a recording medium 100 on which to record the stream. The AV stream is processed in 
10 ECC (error correction and coding) unit 20 and a modulation unit 21 with appendage of ECC codes and with modulation, 
before being output to a write unit 22, which then writes (records) an AV stream file based on a control signals output 
by the controller 23. 

[0032] The transport stream, such as digital television broadcast, input from a digital interface or a digital television 
tuner, is input to a terminal 1 3. There are two recording systems for recording the transport stream input to the terminal 
*s 13, one being a transparent recording system and the other being a system in which recording is preceded by re- 
encoding aimed to lower e.g. , the recording bit rate. The recording system command information is input from a terminal 
24 as a user interface to a controller 23. 

[0033] In the transparent recording of the input transport stream, a transport stream, input to a terminal 1 3, is output 
through a switch 1 7 to a multiplexed stream analysis unit 1 8 and to the source packetizer 1 9. The ensuing processing 
20 of recording an AV stream on a recording medium is the same as that of encoding and recording analog input audio 
and video signals, as described above, and hence is not explained here for simplicity. 

[0034] If the input transport stream is re-encoded and subsequently recorded, the transport stream, input to the 
terminal 1 3, is fed to a demultiplexer 26, which demultiplexes the input transport stream to extract a video stream (V), 
an audio stream (A) and the system information (S). 

25 [0035] Of the stream (information), as extracted by the demultiplexer 26, the video stream is output to an audio 
decoder 27, whilst the audio stream and the system information are output to the multiplexer 16. The audio decoder 
27 decodes the input transport stream to output the encoded video stream (V) to the multiplexer 1 6. 
[0036] The audio stream and the system information, output from the demultiplexer 26 and input to the multiplexer 
16, and the video stream, output by the AV encoder 15, are multiplexed, based on the input system information, and 

30 output to the multiplexed stream analysis unit 18 and to the source packetizer 19 through switch 17, as a multiplexed 
stream. The ensuing processing of recording an AV stream on a recording medium is the same as that of encoding 
and recording analog input audio and video signals, as described above, and hence is not explained here for simplicity. 
[0037] The recording and/or reproducing apparatus 1 of the present embodiment records a file of the AV stream on 
the recording medium 1 00, while also recording the application database information which accounts for the file. The 

35 input information to the controller 23 is the feature information for the moving picture from the analysis unit 14, the 
feature information of the AV stream from the multiplexed stream analysis unit 18 and the user command information 
input at a terminal 24. 

[0038] The feature information of the moving picture, supplied from the analysis unit 1 4, is generated by the analysis 
unit 14 when the AV encoder 15 encodes video signals. The analysis unit 14 analyzes the contents of the input video 
40 and audio signals to generate the information pertinent to the pictures characteristic of the input moving picture signals 
(clip mark). This information is the information indicating a picture of characteristic clip mark points, such as program 
start points, scene change points, CM commercial start and end points, title or telop in input video signals, and also 
includes a thumbnail of the picture and the information pertinent to stereo/monaural switching points and muted portions 
of audio signals. 

45 [0039] The above picture indicating information is fed through controller 23 to the multiplexer 1 6. When multiplexing 
a encoded picture specified as clip mark by the controller 23, the multiplexer 1 6 returns the information for specifying 
the encoded picture on the AV stream to the controller 23. Specifically, this information is the PTS (presentation time 
stamp) of a picture or the address information on the AV stream of an encoded version of the picture. The controller 
23 stores the sort of feature pictures and the information for specifying the encoded picture on the AV stream in asso- 
so ciation with each other. 

[0040] The feature information of the AV stream from the multiplexed stream analysis unit 18 is the information 
pertinent to the encoding information of the AV stream to be recorded, and is recorded by an analysis unit 18. For 
example, the feature information includes the time stamp and address information of the l-picture in the AV stream, 
discontinuous point information of system time clocks, encoding parameters of the AV stream and change point infor- 
ms mation of the encoding parameters in the AV stream. When transparently recording the transport stream, input from 
the terminal 13, the multiplexed stream analysis unit 18 detects the picture of the aforementioned clip mark, from the 
input transport stream, and generates the information for specifying a picture designated by the clip mark and its type. 
[0041] The user designation information from the terminal 24 is the information specifying the playback domain, 
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designated by the user, character letters for explaining the contents of the playback domain, or the information such 
as bookmarks or resuming points set by the user for his or her favorite scene. 

[0042] Based on the aforementioned input information, the controller 23 creates a database of the AV stream (Clip), 
a database of a group (PlayList) of playback domains (Playltem) of the AV stream, management information of the 
5 recorded contents of the recording medium 100 (info.dvr) and the information on thumbnail pictures. Similarly to the 
AV stream, the application database information, constructed from the above information, is processed in the ECC unit 
20 and the modulation unit 21 and input to the write unit 22, which then records a database file on the recording medium 
100. 

[0043] The above-described application database information will be explained subsequently in detail. 

10 [0044] When the AV stream file recorded on the recording medium 100 (files of picture data and speech data) and 
the application database information, thus recorded on the recording medium 100, are reproduced by a reproducing 
unit 3, the controller 23 first commands a readout unit 28 to read out the application database information from the 
recording medium 1 00. The readout unit 28 reads out the application database information from the recording medium 
1 00, which then reads out the application database information from the recording medium 1 00 to send the application 

15 database information through demodulation and error correction processing by a demodulating unit 29 and an ECC 
decoder 30 to the controller 23. 

[0045] Based on the application database information, the controller 23 outputs a list of PlayList recorded on the 
recording medium 1 00 to a user interface of the terminal 24. The user selects the PlayList, desired to be reproduced, 
from the list of PlayLists. The information pertinent to PlayList, specified to be reproduced, is input to the controller 23. 
20 The controller 23 commands the readout unit 28 to read out the AV stream file necessary in reproducing the PlayList. 
In accordance with the command, the readout unit 28 reads out the corresponding AV stream from the recording 
medium 100 to output the read-out AV stream to the demodulating unit 29. The AV stream, thus input to the demodu- 
lating unit 29, is demodulated by preset processing and output through the processing by the ECC decoder 30 to a 
source depacketizer 31 . 

25 [0046] The source depacketizer 31 converts the AV stream of the application format, read out from the recording 
medium 1 00 and processed in a preset fashion, into a stream processable by the demultiplexer 26. The demultiplexer 
26 outputs the system information (S), such as the video stream (V), audio stream (A) or the AV synchronization, 
forming the playback domain (Playltem) of the AV stream specified by the controller 23, to the audio decoder 27, which 
AV decoder 27 decodes the video stream and the audio stream to output the playback video signal and the playback 

30 audio signal to associated terminals 32, 33, respectively, 

[0047] If fed from the terminal 24, as a user interface, with the information instructing random access playback or 
special playback, the controller 23 determines the readout position of the AV stream from the recording medium 100, 
based on the contents of the database (Clip) of the AV stream, to command the readout unit 28 to read out the AV 
stream. If the PlayList as selected by the user is to be reproduced as from a preset time point, the controller 23 com- 

35 mands the readout unit 28 to read out data from an l-picture having a time stamp closest to the specified time point. 
[0048] When the user has selected a certain clip mark from indexing points or scene change points for the program 
stored in the ClipMark in the Clip Information, as when the user selects a certain picture from a list of thumbnail pictures, 
as demonstrated on a user interface, of the indexing points or scene change points stored in the ClipMark, the controller 
23 determines the AV stream readout position from the recording medium 1 00 to command the readout unit 28 to read 

40 out the AV stream. That is, the controller 23 commands the readout unit 28 to read out data from an l-picture having 
an address closest to the address on the AV stream which has stored the picture selected by the user. The readout 
unit 28 reads out data from the specified address . The read-out data is processed by the demodulating unit 29, ECC 
decoder 30 and by the source packetizer 19 so as to be supplied to the demultiplexer 26 and decoded by the audio 
decoder 27 to reproduce AV data indicated by an address of the mark point picture. 

45 [0049] If the user has commanded fast forward playback, the controller 23 commands the readout unit 28 to sequen- 
tially read out l-picture data in the AV stream in succession based on the database (Clip) of the AV stream. 
[0050] The readout unit 28 reads out data of the AV stream from a specified random access point. The so read-out 
data is reproduced through processing by various components on the downstream side. 

[0051] The case in which the user edits the AV stream recorded on the recording medium 1 00 is now explained. If 
50 desired to specify a playback domain for the AV stream recorded on the recording medium 100, for example, if desired 
to create a playback route of reproducing a portion sung by a singer A from a song program A, and subsequently 
reproducing a portion sung by the same singer A from another song program B, the information pertinent to a beginning 
point (IN-point) and an end point (OUT-point) of the playback domain is input to the controller 23 from the terminal as 
a user interface. The controller 23 creates a database of the group (PlayList) of playback domains (Playltem) of the 
55 AV streams. 

[0052] When the user desires to erase a portion of the AV stream recorded on the recording medium 1 00, the infor- 
mation pertinent to the IN-point and the OUT-point of the erasure domain is input to the controller 23, which then 
modifies the database of the PlayList so as to refer to only the needed AV streams. The controller 23 also commands 



8 



EP 1 198 133 A1 

the write unit 22 to erase an unneeded stream portion of the AV stream. 

[0053] The case in which the user desires to specify playback domains of an AV stream recorded on the recording 
medium to create a new playback route, and to interconnect the respective playback domains in a seamless fashion, 
is now explained. In such case, the controller 23 creates a database of a group (PlayList) of the playback domains 
5 (Playltem) of the AV stream and undertakes to partially re-encode and re-multiplex the video stream in the vicinity of 
junction points of the playback domains. 

[0054] The picture information at the IN-point and that at the OUT-point of a playback domain are input from a terminal 
24 to a controller 23. The controller 23 commands the readout unit 28 to read out data needed to reproduce the pictures 
at the IN-point and at the OUT-point. The readout unit 28 reads out data from the recording medium 100. The data so 
10 read out is output through the demodulating unit 29, ECC decoder 30 and the source packetizer 1 9 to the demultiplexer 
26. 

[0055] The controller 23 analyzes data input to the demultiplexer 26 to determine the re-encoding method for the 
video stream (change of picture_coding_type and assignment of the quantity of encoding bits for re-encoding) and the 
re-multiplexing system to send the system to the AV encoder 15 and to the multiplexer 16. 

15 [0056] The demultiplexer 26 then separates the input stream into the video stream (V), audio stream (A) and the 
system information (S). The video stream may be classed into data input to the audio decoder 27 and data input to 
the multiplexer 16. The former is data needed for re-encoding, and is decoded by the audio decoder 27, with the 
decoded picture being then re-encoded by the AV encoder 15 and thereby caused to become a video stream. The 
latter data is data copied from an original stream without re-encoding. The audio stream and the system information 

20 are directly input to the multiplexer 16. 

[0057] The multiplexer 1 6 multiplexes an input stream, based on the information input from the controller 23, to output 
a multiplexed stream, which is processed by the ECC unit 20 and the modulation unit 21 so as to be sent to the write 
unit 22. The write unit 22 records an AV stream on the recording medium 100 based on the control signals supplied 
from the controller 23. 

25 [0058] The application database information and the operation based on this information, such as playback and 
editing, are hereinafter explained. Fig. 2 shows the structure of an application format having two layers, that is PlayList 
and Clip, for AV stream management. The Volume Information manages all Clips and PlayLists in the disc. Here, one 
AV stream and the ancillary information thereof, paired together, is deemed to be an object, and is termed Clip. The 
AV stream file is termed a Clip AV stream file, with the ancillary information being termed the Clip Information file. 

30 [0059] One Clip AV stream file stores data corresponding to an MPEG-2 transport stream arranged in a structure 
prescribed by the application format. By and large, a file is treated as a byte string. The contents of the Clip AV stream 
file are expanded on the time axis, with entry points in the Clip (l-picture) being mainly specified on the time basis. 
When a time stamp of an access point to a preset Clip is given, the Clip Information file is useful in finding the address 
information at which to start data readout in the Clip AV stream file. 

35 [0060] Referring to Fig. 3, PlayList is now explained, which is provided for a user to select a playback domain desired 
to be viewed from the Clip and to edit the playback domain readily. One PlayList is a set of playback domains in the 
Clip. One playback domain in a preset Clip is termed Playltem and is represented by a pair of an IN-point and an OUT- 
point on the time axis. So, the PlayList is formed by a set of plural Playltems. 

[0061 ] The PlayList is classified into two types, one of which is Real PlayList and the other of which is Virtual PlayList. 
40 The Real PlayList co-owns stream portions of the Clip it is referencing. That is, the Real PlayList takes up in the disc 
the data capacity corresponding to a stream portion of the Clip it is referencing and, when Real PlayList is erased, the 
data of the stream portion of the Clip it is referencing is also erased. 

[0062] The Virtual PlayList is not co-owning Clip data. Therefore, if the Virtual PlayList is changed or erased, the 

contents of the Clip are in no way changed. 
45 [0063] The editing of the Real Playlist is explained. Fig.4A shows creation of Real PlayList and, if the AV stream is 

recorded as a new Clip, the Real PlayList which references the entire Clip is a newly created operation. 

[0064] Fig.4B shows the division of the real PlayList, that is the operation of dividing the Real PlayList at a desired 

point to split the Real PlayList in two Real PlayLists, This division operation is performed when two programs are 

managed in one clip managed by a sole PlayList and when the user intends to re-register or re-record the programs 
so as separate individual programs. This operation does not lead to alteration of the Clip contents, that is to division of 

the Clip itself. 

[0065] Fig.4C shows the combining operation of the Real PlayList which is the operation of combining two Real 
PlayLists into one new Real PlayList. This combining operation is performed such as when the user desires to re- 
register two programs as a sole program. This operation does not lead to alteration of the Clip contents, that is to 
55 combining the clip itself into one. 

[0066] Fig.5A shows deletion of the entire Real PlayList. If the operation of erasing the entire preset Real PlayList, 

the associated stream portion of the Clip referenced by the deleted Real PlayList is also deleted. 

[0067] Fig.5B shows partial deletion of the Real PlayList. If a desired portion of the Real PlayList is deleted, the 
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associated Play Item is altered to reference only the needed Clip stream portion. The corresponding stream portion of 
the Clip is deleted. 

[0068] Fig.5C shows the minimizing of the Real PlayList. It is an operation of causing the Playltem associated with 
the Real PlayList to reference only the stream portion of the Clip needed for Virtual PlayList. The corresponding stream 
5 portion of the Clip not needed for the Virtual PlayList is deleted. 

[0069] If the Real PlayList is changed by the above-described operation such that the stream portion of the Clip 
referenced by the Real PlayList is deleted, there is a possibility that the Virtual PlayList employing the deleted Clip is 
present such that problems may be produced in the Virtual PlayList due to the deleted Clip. 

[0070] In order to prevent this from occurring, such a message which runs: "If there exists the Virtual Piay List ref- 
10 erencing the stream portion of the Clip the Real PlayList is referencing, and the Real PlayList is deleted, the Virtual 

PlayList itself is deleted - is it all right?" is displayed for the user in response to the user's operation of deletion by way 

of confirmation or alarming, after which the processing for deletion is executed or cancelled subject to user's commands. 

Alternatively, the minimizing operation for the Real PlayList is performed in place of deleting the Virtual PlayList. 

[0071] The operation for the Virtual PlayList is now explained. If an operation is performed for the Virtual PlayList, 
15 the contents of the Clip are not changed. Figs.6A and 6B show the assembling and editing (IN-OUT editing). It is an 

operation of creating Playltem of the playback domain the user has desired to view to create Virtual PlayList. The 

seamless connection between Playltems is supported by the application format, as later explained. 

[0072] If there exist two Real PlayLists 1 , 2 and clips 1 , 2 associated with the respective Real PlayLists, the user 

specifies a preset domain in the Real PlayList 1 (domain from IN1 to OUT1 : Playltem 1) as the playback domain, and 
20 also specifies, as the domain to be reproduced next, a preset domain in the Real PlayList 2 (domain from IN2 to OUT2: 

Playltem 2) as the playback domain, as shown in Fig. 6A, a sole Virtual PlayList made up of Playltem 1 and the Playltem2 

is prepared, as shown in Fig.6B. 

[0073] The re-editing of the Virtual PlayList is now explained. The re-editing may be enumerated by alteration of IN- 
or OUT points in the Virtual PlayList, insertion or appendage of new Playltems to the Virtual PlayList and deletion of 

25 Playltems in the Virtual PlayList. The Virtual PlayList itself may also be deleted. 

[0074] Fig.7 shows the audio dubbing (post recording) to the Virtual PlayList. It is an operation of registering the 
audio post recording to the Virtual PlayList as a sub path. This audio post recording is supported by the application 
software. An additional audio stream is added as a sub path to the AV stream of the main path of the Virtual PlayList. 
[0075] Common to the Real PlayList and the Virtual PlayList is an operation of changing (moving) the playback 

30 sequence of the PlayList shown in Fig. 8. This operation is an alteration of the playback sequence of the PlayList in the 
disc (volume) and is supported by TableOfPlayList as defined in the application format, as will be explained subse- 
quently with reference to e.g., Fig. 20. This operation does not lead to alteration of the Clip contents. 
[0076] The mark (Mark) is now explained. The mark is provided for specifying a highlight or characteristic time in the 
Clip and in the PlayList, as shown in Fig. 9. The mark added to the Clip is termed the ClipMark. The ClipMark is e.g., 

35 a program indexing point or a scene change point for specifying a characteristic scene ascribable to contents in the 
AV stream. The ClipMark is generated by e.g., the analysis unit 14 of Fig.1 . When the PlayList is reproduced, the mark 
of the Clip referenced by the PlayList may be referenced and used. 

[0077] The mark appended to the PlayList is termed the PlayListMark (play list mark). The PlayListMark is e.g., a 
bookmark point or a resuming point as set by the user. The setting of the mark to the Clip and to the PlayList is by 
40 adding a time stamp indicating the mark time point to the mark list. On the other hand, mark deletion is removing the 
time stamp of the mark from the mark list. Consequently, the AV stream is in no way changed by mark setting or by 
mark deletion. 

[0078] As another format of the ClipMark, a picture referenced by the ClipMark may be specified on the address 
basis in the AV stream. Mark setting on the Clip is by adding the address basis information indicating the picture of the 
45 mark point to the mark list. On the other hand, mark deletion is removing the address basis information indicating the 
mark point picture from the mark list. Consequently, the AV stream is in no way changed by mark setting or by mark 
deletion. 

[0079] A thumbnail is now explained. The thumbnail is a still picture added to the Volume, PlayList and Clip. There 
are two sorts of the thumbnail, one of them being a thumbnail as a representative picture indicating the contents. This 
so is mainly used in a main picture in order for the user to select what the or she desired to view on acting on a cursor, 
not shown. Another sort of the thumbnail is a picture indicating a scene pointed by the mark. 

[0080] The Volume and the respective PlayLists need to own representative pictures. The representative pictures 
of the Volume are presupposed to be used for initially demonstrating a still picture representing the disc contents when 
the disc is set in position in the recording and/or reproducing apparatus 1 . It is noted that the disc means the recording 
55 medium 100 which is presupposed to be a of disc shape. The representative picture of the PlayList is presupposed to 
be used as a still picture for representing PlayList contents. 

[0081] As the representative picture of the PlayList, it may be contemplated to use the initial picture of the PlayList 
as the thumbnail (representative picture). However, the leading picture at the playback time of 0 is not necessarily an 
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optimum picture representing the contents. So, the user is allowed to set an optional picture as a thumbnail of the 
PlayList. Two sorts of the thumbnails, that is the thumbnail as a representative picture indicating the Volume and the 
thumbnail as a representative picture indicating PlayList, are termed menu thumbnails. Since the menu thumbnails 
are demonstrated frequently, these thumbnails need to be read out at an elevated speed from the disc. Thus, it is 
5 efficient to store the totality of the menu thumbnails in a sole file. It is unnecessary for the menu thumbnails to be 
pictures extracted from the moving pictures in the volume, but may be a picture captured from a personal computer or 
a digital still camera, as shown in Fig. 10. 

[0082] On the other hand, the Clip and the PlayList need be marked with plural marks, whilst the pictures of the mark 
points need to be readily viewed in order to grasp the contents of the mark positions. The picture indicating such mark 
10 point is termed a mark thumbnail. Therefore, the picture which is the original of the mark thumbnail is mainly an extracted 
mark point picture rather than a picture captured from outside. 

[0083] Fig. 11 shows the relation between the mark affixed to the PlayList and the mark thumbnail, whilst Fig. 12 
shows the relation between the mark affixed to the Clip and the mark thumbnail. In distinction from the menu thumbnail, 
the mark thumbnail is used in e.g., a sub-menu for representing details of the PlayList, while it is not requested to be 
15 read out in a short access time. So, whenever a thumbnail is required, the recording and/or reproducing apparatus 1 
opens a file and reads out a portion of the file, while there is no problem presented even if file opening and reading 
out a portion of the file by the recording and/or reproducing apparatus 1 takes some time. 

[0084] For decreasing the number of files present in a volume, it is preferred for the totality of the mark thumbnails 
to be stored in one file. While the PlayList may own one menu thumbnail and plural mark thumbnails, the user is not 
20 required to select the Clip directly (usually, the Clip is selected through PlayList), and hence there is no necessity of 
providing menu thumbnails. 

[0085] Fig. 13 shows the relation between the menu thumbnails, mark thumbnails, PlayList and Clips. In the menu 
thumbnail file are filed menu thumbnails provided from one PlayList to another. In the menu thumbnail file is contained 
a volume thumbnail representing the contents of data recorded on the disc. In the menu thumbnail file are filed thumb- 

25 nails created from one PlayList to another and from one Clip to another. 

[0086] The CPI (Characteristic Point Information) is hereinafter explained. The CPI is data contained in the Clip 
information file and is used mainly for finding a data address in the Clip AV stream file at which to start the data readout 
when a time stamp of the access point to the Clip is afforded. In the present embodiment two sorts of the CPI are used, 
one of them being EP_map and the other being TU_map. 

30 [0087] The EP_map is a list of entry point (EP) data extracted from the elementary stream and the transport stream. 
This has the address information used to find the site of entry points in the AV stream at which to start the decoding. 
One EP data is made up of a presentation time stamp (PTS) and a data address in the AV stream of the accessing 
unit associated with the PTS, with the data address being paired to the PTS. 

[0088] The EPjnap is used mainly for two purposes. First, it is used for finding a data address in the AV stream in 
35 the accessing unit referenced by the PTS in the PlayList. Second, the EP_map is used for fast forward playback or 
fast reverse playback. If, in recording the input AV stream by the recording and/or reproducing apparatus 1 , the syntax 
of the stream can be analyzed, the EP_map is created and recorded on the disc. 

[0089] The TU_map has a list of time unit (TU) data which is derived from the arrival time point of the transport packet 
input through a digital interface. This affords the relation between the arrival-time-based time and the data address in 
40 the AV stream. When the recording and/or reproducing apparatus 1 records an input AV stream, and the syntax of the 
stream cannot be analyzed, a TILmap is created and recorded on the disc. 

[0090] The STCInfo stores the discontinuous point information in the AV stream file which stores the MPEG-2 trans- 
port stream. 

[0091] When the AV stream has discontinuous points of STC, the same PTS values may appear in the AV stream 
45 file. Thus, if a time point in the AV stream is specified on the PTS basis, the PTS pf the access point is insufficient to 
specify the point. Moreover, an index of the continuous STC domain containing the PTS is required. In this format, the 
continuous STC domain and its index are termed an STC-sequence and STC-sequence-id, respectively. The STC- 
sequence information is defined by the STCInfo of the Clip Information file. 

[0092] The STC-sequence-id is used in an AV stream file and is optional in the AV stream file having the TLLmap. 
so [0093] The programs are each a collection of elementary streams and co-own a sole system time base for synchro- 
nized reproduction of these streams. 

[0094] It is useful for a reproducing apparatus (recording and/or reproducing apparatus 1 of Fig. 1) to know the 
contents of an AV stream prior to its decoding. These contents include e.g., values of the PID of a transport packet 
transmitting an audio or video elementary stream or the type of the video or audio components, such as HDTV video 
55 or MPEG-2 AAC audio stream. This information is useful for creating a menu screen toy illustrating to the user the 
contents of the PlayList referencing the AV stream. It is similarly useful for setting the initial state of the AV decoder 
and the demultiplexer of the respective apparatus. 

[0095] For this reason, the Clip Information file owns Programlnfo for illustrating the program contents. 
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[0096] It may be an occurrence that program contents be changed in the AV stream file in which the MPEG-2 transport 
stream is stored. For example, the PID of the transport packet transmitting the video elementary stream may be 
changed, or the component type of the video stream may be changed from SDTV to HDTV. 

[0097] The Programlnfo stores the information on change points of program contents in the AV stream file. The 
5 domain of the AV stream file in which the program contents remain constant is termed program-sequence. 

[0098] This program-sequence is used in an AV stream file having EP_map and is optional in an AV stream file 
having TU_map. 

[0099] The present embodiment defines the self-encoding stream format (SESF). The SESF is used for encoding 
analog input signals and for decoding digital input signals for subsequently encoding the decoded signal into an MPEG- 
10 2 transport stream. 

[0100] The SESF defines an elementary stream pertinent to the MPEG-2 transport stream and the AV stream. When 
the recording and/or reproducing apparatus 1 encodes and records input signals in the SESF, an EPjnap is created 
and recorded on the disc. 

[0101] A digital broadcast stream uses one of the following systems for recording on the recording medium 100: 
'5 First, the digital broadcast stream is transcoded into an SESF stream. In this case, the recorded stream must conform 
to SESF and EP_map must be prepared and recorded on the disc. 

[0102] Alternatively, an elementary stream forming a digital broadcast stream is transcoded to a new elementary 
stream and re-multiplexed to a new transport stream conforming to the stream format prescribed by the organization 
for standardizing the digital broadcast stream. In this case, an EPjmap must be created and recorded on the disc. 
20 [0103] For example, it is assumed that the input stream is an MPEG-2 transport stream conforming to the ISDB 
(standard appellation of digital BS of Japan), with the transport stream containing the HDTV video stream and the 
MPEG AAC audio stream. The HDTV video stream is transcoded to an SDTV video stream, which SDTV video stream 
and the original AAC audio stream are re-multiplexed to TS. The SDTV stream and the transport stream both need to 
conform to the ISDB format. 

25 [01 04] Another system of recording the digital broadcast stream on the recording medium 1 00 is to make transparent 
recording of the input transport stream, that is to record the input transport stream unchanged, in which case the 
EP_map is formulated and recorded on the disc. 

[01 05] Alternatively, the input transport stream is recorded transparently, that is an input transport stream is recorded 
unchanged, in which case TU__map is created and recorded on the disc. 

30 [0106] The directory and the file are hereinafter explained. The recording and/or reproducing apparatus 1 is herein- 
after described as DVR (digital video recording). Fig. 14 shows a typical directory structure on the disc. The directories 
of the disc of the DVR may be enumerated by a root directory including "DVR" directory, and the "DVR" directory, 
including "PLAY LI ST" directory, "CLIPINF" directory, "M2TS" directory and "DATA" directory, as shown in Fig. 14. Al- 
though directories other than these may be created below the root directory, these are discounted in the application 

35 format of the present embodiment. 

[01 07] Below the "DATA" directory, there are stored all files and directories prescribed by the DVR application format. 
The "DVR" directory includes four directories. Below the "PLAYLIST" directory are placed Real PlayList and Virtual 
PlayList database files. The latter directory may exist in a state devoid of PlayList. 

[0108] Below "CLIPINF" is placed a Clip database. This directory, too, may exist in a state devoid of AV stream files. 

40 in the "DATA" directory, there are stored files of data broadcast, such as digital TV broadcast. 

[01 09] The "DVR" directory stores the following files: That is, an "info.dvr" is created below the DVR directory to store 
the comprehensive information of an application layer. Below the DVR directory, there must be a sole info.dvr. The 
filename is assumed to be fixed to info.dvr. The "menu.thmb" stores the information pertinent to the menu thumbnails. 
Below the DVR directory, there must be 0 or 1 mark thumbnail. The filename is assumed to be fixed to "menu.thmb". 

45 |f there is no menu thumbnail, this file may not exist. 

[0110] The "mark.thmb" file stores the information pertinent to the mark thumbnail picture. Below the DVR directory, 
there must be 0 or 1 mark thumbnail. The filename is assumed to be fixed to "menu.thmb". If there is no menu thumbnail, 
this file may not exist. 

[0111] The "PLAYLIST" directory stores two sorts of the PlayList files which are Real PlayList and Virtual PlayList. 
50 An "xxxxx.rpls" file stores the information pertinent to one Real PlayList. One file is created for each Real PlayList. The 
filename is "xxxxx.rpls", where "xxxxx" denotes five numerical figures from 0 to 9. A file extender must be "rpls". 
[0112] The "yyyyy.vpls" stores the information pertinent to one Virtual PlayList. One file with a filename "yyyyy.vpls" 
is created from one Virtual PlayList to another, where "yyyyy" denotes five numerical figures from 0 to 9. A file extender 
must be "vpls". 

55 [0113] The "CLIPINF" directory stores one file in association with each AV stream file. The "zzzzz.clpi" is a Clip 
Information file corresponding to one AV stream file (Clip AV stream file or Bridge-Clip stream file). The filename is 
"zzzzz.clpi", where "zzzzz" denotes five numerical figures from 0 to 9. A file extender must be "dpi". 
[01 14] The "M2TS" directory stores an AV stream file. The "zzzzz.m2ts" file is an AV stream file handled by the DVR 
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system. This is a Clip AV stream file or a Bridge-Clip AV stream file. The filename is "zzzzz.m2ts", where "zzzzz" 
denotes five numerical figures from 0 to 9. A file extender must be "m2ts". 

[0115] The "DATA" directory stores data transmitted from data broadcasting. This data may, for example, be XML 
or MPEG files. 

5 [01 16] The syntax and the semantics of each directory (file) are now explained. Fig.1 5 shows the syntax of the "info, 
dvr" file. The "info.dvr" file is made up of three objects, that is DVRVoume(), TableOf PlayLists() and Makers Private Data 
()• 

[0117] The syntax of info.dvr shown in Fig. 15 is explained. The Tab I eOfPlayLists_Start_address indicates the leading 
address of the TableOfPlayLists() in terms of the relative number of bytes from the leading byte of the "info.dvr" file. 

10 The relative number of bytes is counted beginning from 0. 

[0118] The MakersPrivateData_Start_address indicates the leading address of the MakersPrivateDataQ, in terms 
of the relative number of bytes as from the leading byte of the "info.dvr" file. The relative number of bytes is counted 
from 0. The padding_word is inserted in association with the syntax of "info.dvr". N1 and N2 are optional positive 
integers. Each padding word may assume an optional value. 

'5 [0119] The DVRVolumeQ stores the information stating the contents of the volume (disc). Fig. 16 shows the syntax 
of the DVRVolume. The syntax of the DVRVolume(), shown in Fig. 1 6, is now explained. The version_number indicates 
four character letters indicting the version numbers of the DVRVolume(). The version_number is encoded to "0045" in 
association with IS0646. 

[0120] Length is denoted by 32-bit unsigned integers indicating the number of bytes from directly after the length 
20 field to the trailing end of DVRVolumeQ. 

[0121] The ResumeVoIume() memorizes the filename of the Real PlayList or the Virtual PlayList reproduced last in 
the Volume. However, the playback position when the user has interrupted playback of the Real PlayList or the Virtual 
PlayList is stored in the resume-mark defined in the PlayListMark() (see Figs.42 and 43). 

[0122] Fig. 17 shows the syntax of the ResumeVolumeQ. The syntax of the ResumeVolumeQ shown in Fig. 17 is 
25 explained. The valid Jlag indicates that the resume_PlayList_name field is valid or invalid when this 1 -bit flag is set to 
1 or 0, respectively. 

[01 23] The 1 0-byte field of resume_PlayList_name indicates the filename of the Real PlayList or the Virtual PlayList 
to be resumed. 

[0124] The UlApplnfoVolume in the syntax of the DVRVolume(), shown in Fig. 16, stores parameters of the user 
30 interface application concerning the Volume. Fig.1 8 shows the syntax of the UlApplnfoVolume, the semantics of which 
are now explained. The 8-bit field of character_set indicates the encoding method for character letters encoded in the 
Volume_name field. The encoding method corresponds to the values shown in Fig.1 9. 

[01 25] The 8-bit field of the namejength indicates the byte length of the Volume name indicated in the Volume_name 
field. The Volume_name field indicates the appellation of the Volume. The number of bytes of the number of the 
35 namejength counted from left of the field is the number of valid characters and indicates the volume appellation. The 
values next following these valid character letters may be any values. 

[0126] The Volume_protect_flag is a flag indicating whether or not the contents in the Volume can be shown to the 
user without limitations. If this flag is set to 1 , the contents of the Volume are allowed to be presented (reproduced) to 
the user only in case the user has succeeded in correctly inputting the PIN number (password). If this flag is set to 0, 

40 the contents of the Volume are allowed to be presented to the user even in case the PIN number is not input by the user. 
[0127] If, when the user has inserted a disc into a player, this flag has been set to 0, or the flag is set to 1 but the 
user has succeeded in correctly inputting the PIN number, the recording and/or reproducing apparatus 1 demonstrates 
a list of the PlayList in the disc. The limitations on reproduction of the respective PlayLists are irrelevant to the 
Volume_protect_flag and are indicated by playback_control_flag defined in the UlApplnfoVolume. 

45 [0128] The PIN is made up of four numerical figures of from 0 to 9, each of which is coded in accordance with ISO/ 
I EC 646. The ref_thumbnail_index field indicates the information of a thumbnail picture added to the Volume. If the 
ref_thumbnail_index field is of a value other than OxFFFF, a thumbnail picture is added to the Volume. The thumbnail 
picture is stored in a menu. thumb file. The picture is referenced using the value of the ref_thumbnail_index in the menu, 
thumb file. If the reMhumbnailjndex field is OxFFFF, it indicates that a thumbnail picture has been added to the Volume. 

50 [0129] The TableOfPlayList() in the info.dvr syntax shown in Fig. 15 is explained. The TableOf PlayList() stores the 
filename of the PlayList (Real PlayList and Virtual PlayList). All PlayList files recorded in the Volume are contained in 
the TableOfPlayList(), which TableOfPlayList() indicates the playback sequence of the default of the PlayList in the 
Volume. 

[0130] Fig.20 shows the syntax of the TableOf PlayListQ, which is now explained. The version_number of the Table- 
55 OfPlayList() indicates four character letters indicating the version numbers of the TableOf PlayLists. The 
version_number must be encoded to M 0045" in accordance with ISO 646. 

[0131] Length is a unsigned 32-bit integer indicating the number of bytes of the TableOf PlayList() from directly after 
the length field to the trailing end of the TableOfPlayListQ. The 16-bit field of the number_of_PlayLists indicates the 
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number of loops of the for-loop inclusive of the PlayList Jile_name. This numerical figure must be equal to the number 
of PlayLists recorded in the Volume. The 10-byte numerical figure of the Playl_istjile_name indicates the filename of 
the PlayLists. 

[0132] Fig. 21 shows another configuration ofthe syntax of the TableOfPlayListQ. The syntax shown in Fig.21 iscom- 
5 prised of the syntax shown in Fig. 20 in which is contained the UlApplnfoPlayList. By such structure including the 
UlApplnfoPlayList, it becomes possible to create a menu picture simply on reading out the TableOf PlayLists. The 
following explanation is premised on the use of the syntax shown in Fig.20. 

[0133] The MakersPrivateData in the info.dvr shown in Fig. 15 is explained. The MakersPrivateData is provided to 
permit the maker of the recording and/or reproducing apparatus 1 to insert private data of the maker in the MakersPri- 

10 vateData() for special applications of different companies. The private data of each maker has standardized makerJD 
for identifying the maker who has defined it. The Makers PrivateData() may contain one or more makerJD. 
[0134] If a preset maker intends to insert private data, and the private data of a different maker is already contained 
in the MakersPrivateData(), the new private data is added to the MakersPrivateData() without erasing the pre-existing 
old private data. Thus, in the present embodiment, private data of plural makers can be contained in one MakersPri- 

« vateData(). 

[0135] Fig. 22 shows the syntax of the MakersPrivateData. The syntax of the MakersPrivateData shown in Fig.22 is 
explained. The version_number of the TableOf PlayListQ indicates four character letters indicating the version numbers 
of the TableOf PlayLists. The version_n umber must be encoded to "0045" in accordance with ISO 646. Length is a 
unsigned 32-bit integer indicating the number of bytes of the TableOf PlayList() from directly after the length field to the 

20 trailing end of the MakersPrivateData(). 

[0136] The mpdJ)locks_start_address indicates the leading end address of the first mpd_block() in terms of the 
relative number of bytes from the leading byte of the MakersPrivateDataQ. The number_of_maker_entries is the 16-bit 
codeless integer affording the number of entries of the maker private data included in the MakersPrivateData(). There 
must not be present two or more maker private data having the same makerJD values in the MakersPrivateData(). 

25 [0137] The mpd_blocks_size is a 16-bit unsigned integer affording one mpd_block size in terms of 1024 bytes as a 
unit. For example, if mpd_block_size = 1, it indicates that the size of one mpd_block is 1024 bytes. The 
number_of_mpd_blocks is a 16-bit unsigned integer affording the number of mpd_blocks contained in the MakersPri- 
vateData(). The makerJD is the 1 6-bit unsigned integer indicating the model number code of the DVR system which 
has created the maker private data. The value encoded to the makerJD is specified by the licensor. 

30 [0138] The maker_model_code is a 16-bit unsigned integer indicating the model number code of the DVR system 
which has created the maker private data. The value encoded to the maker_model_code is set by the maker who has 
received the license of the format. The start_mpd_block_n umber is a 1 6-bit unsigned integer indicating the number of 
the mpd_block number at which begins the maker private data. The leading end of the maker private data must be 
aligned with the leading end of the mpdJ>lock. The start_mpd_block_number corresponds to a variable j in the for- 

35 loop of the mpd_block. 

[0139] The mpdjength is a 32-bit unsigned integer indicating the size of the maker private data. The mpdjalock is 
an area in which is stored maker private data. All of the mpd J>locks in the MakersPrivateData() must be of the same 
size. 

[0140] The real PlayList file and the Virtual PlayList file, in other words, xxxxx.rpls and yyyyy.vpls, are explained. 
40 Fig.23 shows the syntax of xxxxx.rpls (Real PlayList) and yyyyy.vpls (Virtual PlayList), which are of the same syntax 
structure. Each of the xxxxx.rpls and yyyyy.vpls is made up of three objects, that is PlayListQ, PlayListMarkQ and 
MakersPrivateData(). 

[0141] The PlayListMark_Start_address indicates the leading address of the PlayListMark(), in terms of the relative 
number of bytes from the leading end of the PlayList file as a unit. The relative number of bytes is counted from zero. 
45 [0142] The Makers PrivateData_Start_address indicates the leading address of the Makers PrivateDataf), in terms 
of the relative number of bytes from the leading end of the PlayList file as a unit. The relative number of bytes is counted 
from zero. 

[0143] The padding_word (padding word) is inserted in accordance with the syntax of the PlayList file, with N1 and 
N2 being optional positive integers. Each padding word may assume an optional value. 
so [0144] PlayList will be further explained in the following although it has been explained briefly. A playback domain 
in all Clips except Bridge-Clip must be referred by all PlayLists in the disc. Also, two or more Real PlayLists must not 
overlap the playback domains shown by their Playltems in the same Clip. 

[0145] Reference is made to Figs.24A, 24B and 24C. For all Clips, there exist corresponding Real PlayLists, as 
shown in Fig.24A. This rule is observed even after the editing operation has come to a close, as shown in Fig.24B. 
55 Therefore, all Clips must be viewed by referencing one of Real PlayLists. 

[0146] Referring to Fig.24C, the playback domain of the Virtual PlayList must be contained in the playback domain 
and in the Bridge-Clip playback domain. There must not be present in the disc Bridge-Clip not referenced by any Virtual 
PlayList. 
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[0147] The Real PlayList, containing the list of the Play Item, must not contain SubPlayltem. The Virtual PlayList 
contains the Playltem list and, if the CPIJype contained in the PlayList() is the EPjnap type and the PlayList Jype is 
0 (PlayList containing video and audio) , the Virtual PlayList may contain one SubPlayltem. In the PlayList() in the 
present embodiment, the SubPlayltem is used only for audio post recording. The number of the SubPlayltems owned 

5 by one Virtual PlayList must be 0 or 1 . 

[0148] The PlayList is hereinafter explained. Fig.25 shows the PlayList syntax which is now explained. The 
version_number indicates four character letters indicting the version numbers of the PlayList(). The version_number 
is encoded to "0045" in association with ISO 646. Length is a 32-bit unsigned integer indicating the total number of 
byte of the PlayList() as from directly after the length field to the trailing end of the PlayListQ. The PlayList_type, one 

10 example of which is shown in Fig.26, is an 8-bit field indicating the PlayList type. 

[01 49] The CPI_type is one-bit flag indicating the value of the CPIJype of the Clip referenced by the Playltem() and 
by the SubPlayltem(). The CPLtypes defined in the CPIs of all Clips referenced by one PlayList must be of the same 
values. The number_of_Playltems is a 16-bit field indicating the number of Play Items present in the PlayList. 
[0150] The Playltemjd corresponding to the preset PlayltemQ is defined by the sequence in which the Playltem() 

*5 appears in the for-loop containing the Playltem(). The Playltemjd begins with 0. The nimberj>f_Sub Play Items is a 
16-bit field indicating the number of SubPlayltem in the PlayList. This value is 0 or 1 . An additional audio stream path 
(audio stream path) is a sort of a sub path. 

[0151] The UlApplnfoPlayList of the PlayList syntax shown in Fig.25 is explained. The UlApplnfoPlayList stores 
parameters of the user interface application concerning the PlayList. Fig.27 shows the syntax of the UlApplnfoPlayList, 

20 which is now explained. The character_set is an 8-bit field indicating the method for encoding character letters encoded 
in the PlayList_name field. The encoding method corresponds to the values conforming to the table shown in Fig. 19. 
[01 52] The namejength is an 8-bit field indicating the byte length of the PlayList name indicated in the PlayList_name 
field. The PlayList_name field shows the appellation of the PlayList. The number of bytes of the number of the 
namejength counted from left of the field is the number of valid characters and indicates the PlayList appellation. The 

25 values next following these valid character letters may be any values. 

[01 53] The recordJime_and_date is a 56-bit field storing the date and time on which the PlayList was recorded. This 
field is 14 numerical figures for year/ month/ day/ hour/minute/ second encoded in binary coded decimal (BCD). For 
example, 2001/12/23:01:02:03 is encoded to "0x20011223010203". 

[0154] The duration is a 24-bit field indicating the total replay time of the PlayList in terms of hour/ minute/ second 
30 as a unit. This field is six numerical figures encoded in binary coded decimal (BCD). For example, 01 :45:30 is encoded 
to "0x01 4530". 

[0155] The valid jseriod is a 32-bit field indicating the valid time periods of the PlayList. This field is 8 numerical 
figures encoded in 4-bit binary coded decimal (BCD). The valid_period is used in the recording and/or reproducing 
apparatus 1 e.g., when the PlayList, for which the valid period has lapsed, is to be automatically erased. For example, 

35 2001/05/07 is encoded to "0x20010507". 

[0156] The makerJD is a 16-bit unsigned integer indicating the maker of the DVR player (recording and/or repro- 
ducing apparatus 1) which has been the last to update its PlayList. The value encoded to makerJD is assigned to the 
licensor of the DVR format. The maker_code is a 16-bit unsigned integer indicating the model number of the DVR 
player which has been the last to update the PlayList. The value encoded to the maker_code is determined by the 

40 maker who has received the license of the DVR format. 

[0157] If the flag of the playback_contro1Jlag is set to 1 , its PlayList is reproduced only when the user successfully 
entered the PIN number. If this flag is set to 0, the user may view the PlayList without the necessity of inputting the 
PIN number. 

[0158] If the write_protectJlag is set to 1, the contents of the PlayList are not erased nor changed except the 
45 write_protectJlag. If this flag is set to 0, the user is free to erase or change the PlayList. If this flag is set to 1 , the 
recording and/or reproducing apparatus 1 demonstrates a message requesting re-confirmation by the user before the 
user proceeds to erase, edit or overwrite the PlayList. 

[0159] The Real PlayList, in which the write jDrotectJIag is set to 0, may exist, the Virtual PlayList, referencing the 
Clip of the Real PlayList may exist, and the write j>rotectJlag of the Virtual PlayList may be set to 1 . If the user is 
50 desirous to erase the Real PlayList, the recording and/or reproducing apparatus 1 issues an alarm to the user as to 
the presence of the aforementioned Virtual PlayList or "minimizes" the Real PlayList before erasing the Real PlayList. 
[0160] If isjDlayedJIag is set to 1 , as shown in Fig.28B, it indicates that the PlayList was reproduced at least once 
since it was recorded, whereas, if it is set to 0, it indicates that the PlayList was not reproduced even once since it was 
recorded. 

55 [0161] Archive is a two-bit field indicating whether the PlayList is an original or a copy, as shown in Fig.28C. The 
field of ref Jhumbnailjndex indicates the information of a thumbnail picture representative of the PlayList. If the 
ref Jhumbnailjndex field is of a value other than OxFFFF, a thumbnail picture representative of the PlayList is added 
in the PlayList, with the PlayList being stored in the menu.thmb file. The picture is referenced using the value of 
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ref Jhumbnailjndex in the menu.thmb file. If the ref Jhumbnail_index field is OxFFFF, no thumbnail picture represent- 
ative of the PlayList is added in the PlayList. 

[0162] The Playltem is hereinafter explained. One Playltem() basically contains the following data: 
ClipJnfoi*mation_file_name for specifying the filename of the Clip, IN-time and OUT-time, paired together to specify 
5 the playback domain of Clip, STC_sequence_id referenced by IN-time and OUT-time in case the CPIJype defined in 
PtayUst() is EP_map type, and Connection_Condition indicating the connection condition of previous Playltem and 
current Playltem. 

[0163] If PlayList is made up of two or more Playltems, these Playltems are arrayed in a row, without temporal gap 
or overlap, on the global time axis of the PlayList. If CPI_type defined in the PlayList is EP_map type and the current 
10 PlayList does not have the BridgeSequence(), the IN-time and OUT-time pair must indicate the same time on the STC 
continuous domain as that specified by the STC_sequence_id. Such instance is shown in Fig.29. 
[0164] Fig.30 shows such a case in which the CPI_type defined by PlayList() and, if the current Playltem has the 
BridgeSequence(), the rules as now explained are applied. The IN_time of the Playltem previous to the current Play- 
ltem, shown as INJimel , indicates the time in Bridge-Clip specified in the BridgeSequencelnfo() of the current Play- 
's item. This OUTjime must obey the encoding limitations which will be explained subsequently. 

[0165] The INJime of the current Playltem, shown as IN_time2, indicates the time in Bridge-Clip specified in the 
BridgeSequencelnfo() of tne current Playltem. This INJime also must obey the encoding limitations as later explained. 
The OUTjime of Playltem of the current Playltem, shown as OUTJime2, indicates the time on the STC continuous 
domain specified by STC_sequenbce_id of the current Playltem. 
20 [0166] If the CPIJype of PlayList() isTUjnap type, the INJime and OUTjime of Playltem, paired together, indicate 
the time on the same Clio AV stream, as shown in Fig.31 . 

[0167] The Playltem syntax is as shown in Fig.32. As to the syntax of the Playltem, shown in Fig. 32, the field of the 
Clipjnformation Jile_name indicates the filename of the Clip Information. The Clip_stream_type defined by the Cliplnfo 
() of this Clip Information file must indicate the Clip AV stream. 
25 [0168] The STC_sequenceJd is an 8-bit field and indicates the STC_sequence_id of the continuous STC domain 
referenced by the Playltem. If the CPIJype specified in the PlayListQ is TU_map type, this 8-bit field has no meaning 
and is set to 0. INJime is a 32-bit field and used to store the playback start time of Playltem. The semantics of INJime 
differs with CPIJype defined in the PlayList(), as shown in Fig. 33. 

[0169] OUTjime is a 32-bit field and is used to store the playback end time of Playltem. The semantics of OUTjime 
30 differs with CPIJype defined in the PlayList(), as shown in Fig. 34. 

[0170] Connection_condition is a 2-bit field indicating the connection condition between the previous Playltem and 
the current Playltem, as shown in Fig. 35. Figs.36A to 36D illustrate various states of Connection_condition shown in 
Fig.35. 

[0171] BridgeSequencelnfo is explained with reference to Fig.37. This BridgeSequencelnfo is the ancillary informa- 
35 tion of the current Playltem and includes the following information. That is, BridgeSequencelnfo includes 
Bridge_ClipJnformationJile_name for specifying the Bridge_Clip AV file and a Bridge_Clip_lnformationJile_name 
specifying the corresponding Clip Information file (Fig.45). 

[0172] It is also an address of a source packet on the Clip AV stream referenced by the previous Playltem. Next to 
this source packet is connected the first source packet of the Bridge-Clip AV stream. This address is termed the 
40 RSPN_exitJromj3revious_Clip. It is also an address of the source packet on the Clip AV stream referenced by the 
current Playltem. Ahead of this source packet is connected the last source packet of the Bridge_clip AV stream file. 
This address is termed RSPN_enterJo_current_Clip. 

[0173] In Fig.37 t RSPN_arrivalJime_discontinuity indicates an address of a source packet in the Bridge_Clip AV 
stream where there is a discontinuous point in the arrival time base. This address is defined in the Cliplnfo() (Fig.46). 
45 [0174] Fig. 38 shows the syntax of the BridgeSequencelnfo. Turning to the syntax of BridgeSequencelnfo shown in 
Fig. 38, the field of Bridge J^lip J nformat ion Jile_name indicates the filename of the Clip Information file corresponding 
to the Bridge_ClipJnformation Jile. The Clip_stream_type defined in Cliplnfo() of this Clip information file must indicate 
'Bridge_Clip AV stream*. 

[0175] The 32-bit field of the RSPN_exitJrom_previous_Clip is a relative address of a source packet on the Clip AV 
50 stream referenced by the previous Playltem. Next to this source packet is connected the first source packet of the 
Bridge_Clip AV stream file. The RSPN_exitJrom_previous_Clip has a size based on the source packet number as a 
unit, and is counted with the value of the offset_SPN defined in the Cliplnfo() from the first source packet of the Clip 
AV stream file referenced by the previous Playltem. 

[0176] The 32-bit field of RSPN_enterJo_curenLCIip is the relative address of the source packet on the Clip AV 
55 stream referenced by the current Playltem. Ahead of this source packet is connected the last source packet of the 
Bridge_Clip__AV stream file. The RSPN_enterJo_cu rent J2 lip has a size that is based on the source packet number 
as a unit. The RSPN_enterjo_curent_Clip is counted with the value of the offset_SPN, defined in the CliplnfoQ from 
the first source packet of the Clip AV stream file referenced by the current Playltem, as an initial value. 
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[0177] The SubPlayltem is explained with reference to Fig.39. The use of SubPlayltem() is permitted only if the 
CPI_type of the PlayList() is the EP_map type. In the present embodiment, SubPlayltem is used only for audio post 
recording. The SubPlayltemQ includes the following data. First, it includes Clipjnformation_file_name for specifying 
the Clip referenced by the sub path in the PlayList. 

s [0178] It also includes SubPathJfsMime and SubPath_OUT_time for specifying the sub path playback domain in 
the Clip. Additionally, it includes sync_Playltem_id and start_PTS_of_Playltem for specifying the time of starting the 
sub path reproduction on the main path time axis. The Clip AV stream, referenced by the sub path, must not contain 
STC discontinuous points (discontinuous points of the system time base). The clocks of audio samples of the Clip used 
in the sub path are locked to the clocks of the audio samples of the main path. 

to [0179] Fig.40 shows the syntax of the SubPlayltem. Turning to the syntax of the SubPlayltem, shown in Fig.40, the 
field of the ClipJnformation_file_name indicates the filename of the Clip Information file and is used by a sub path in 
the PlayList. The Clip_stream_type defined in this CliplnfoQ must indicate the Clip AV stream. 
[0180] An 8-bit field of sync_Playltemjd indicates the sub path type. Here, only '0x00' is set, as shown in Fig.41, 
while other values are reserved for future use. 

*s [01 81 ] The 8-bit field of sync_Play ltem_id indicates the Playltemjd of the Playltem containing the time of playback 
start of the sub path on the time axis of the main path. The value of Playltemjd corresponding to the preset Playltem 
is defined in the PlayList() (Fig.25). 

[0182] A 32-bit field of sync_start_PTS_of_Playltem denotes the time of playback start of the sub path on the time 
axis of the main path, and denotes the upper 32 bits of the PTS (presentation time stamp) on the Playltem referenced 
20 by the sync_Playltem_id. The upper 32 bit field of the SubPathJNJime stores the playback start time of the sub path . 
SubPathJN Jime denotes upper 32 bits of the PTS of 33 bits corresponding to the first presentation unit in the sub path . 
[0183] The upper 32 bitfield of subPath_OUT_time stores the playback end time of the sub path. SubPath_OUT_time 
indicates upper 32 bits of the value of the Presentation_end_TS calculated by the following equation: 

25 

Presentation_end_TS = PTSJDUT + AU_duration 

where PTS_out is the PTS of the 33 bit length corresponding to the last presentation unit of the SubPath and 
AU_duration is the 90 kHz based display period of the last presentation unit of the SubPath. 

30 [0184] Next, PlayListMark() in the syntax of xxxxx.rpls and yyyyy.vpls shown in Fig.23 is explained. The mark infor- 
mation pertinent to the PlayList is stored in this PlayListMark. Fig. 42 shows the syntax of PlayListMark. Turning to the 
syntax of the PlayListMark shown in Fig.42, version_number is four character letters indicating the version number of 
this PlayListMark(). The version_number must be encoded to "0045" in accordance with ISO 646. 
[0185] Length is an unsigned 32 -bit integer indicating the number of bytes of PlayListMark() from directly after the 

35 length field to the trailing end of the PlayListMark(). The number_of_PlayListMarks is a 1 6-bit unsigned integer indicating 
the number of marks stored in the PlayListMark. The number_of_PlayListMarks may be zero. The markjype is an 
8-bit field indicating the mark type and is encoded in the table shown in Fig.43. 

[01 86] A 32-bit filed of mark_time_stamp stores a time stamp indicating the point specified by the mark. The seman- 
tics of the mark_time_stamp differ with CPI_type defined in the PlayListQ, as shown in Fig. 44. The Playltemjd is an 
40 8-bit field specifying the Playltem where the mark is put. The values of Playltemjd corresponding to a preset Playltem 
is defined in the PlayListQ (see Fig.25). 

[0187] An 8-bit field of character_set shows the encoding method of character letters encoded in the mark_name 
field. The encoding method corresponds to values shown in Fig. 19. The 8-bit field of namejength indicates the byte 
length of the mark name shown in the mark_name field. The mark_name field denotes the mark name indicated in the 
45 mark_name field. The number of bytes corresponding to the number of namejengths from left of this field is the 
effective character letters and denotes the mark name. In the mark_name field, the value next following these effective 
character letters may be arbitrary. 

[0188] The field of the reMhumbnaiUndex denotes the information of the thumbnail picture added to the mark. If 
the field of the reMhumbnaiUndex is not OxFFFF, a thumbnail picture is added to its mark, with the thumbnail picture 
so being stored in the mark.thmb file. This picture is referenced in the mark.thmb file, using the value of 
reMhumbnaiUndex, as explained subsequently. If the reMhumbnaiUndex field is OxFFFF, it indicates that no thumb- 
nail picture is added to the mark. 

[0189] The Clip Information file is now explained. The zzzzz.clpi (Clip Information file) is made up of six objects, as 
shown in Fig.45. These are Cliplnfo(), STC Jnfo(), Program(), CPI(), ClipMarkf) and Makers PrivateData(). For the AV 
55 stream (Clip AV stream or Bridge-Clip AV stream) and the corresponding Clip Information file, the same string of nu- 
merals "zzzzz" is used. 

[0190] Turning to the syntax of zzzzz.clpi (Clip Information file) shown in Fig.45 is explained. The 
Cliplnfo_Start_address indicates the leading end address of CliplnfoQ with the relative number of bytes from the leading 
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end byte of the zzzzz.clpi file as a unit. The relative number of bytes is counted from zero. 

[01 91 ] The STC_lnfo_Start_address indicates the leading end address of STCJnf o with the relative number of bytes 
from the leading end byte of the zzzzz.clpi file as a unit. The Programlnfo_Start_address indicates the leading end 
address of Programlnfo() with the relative number of bytes from the leading end byte of the zzzzz.clpi file as a unit. 
5 The relative number of bytes is counted from 0. The CPI_Start_address indicates the leading end address of CPI() 
with the relative number of bytes from the leading end byte of the zzzzz.clpi file as a unit. The relative number of bytes 
is counted from zero. 

[01 92] The ClipMark_Start_address indicates the leading end address of ClipMark() with the relative number of bytes 
from the leading end byte of the zzzzz.clpi file as a unit. The relative number of bytes is counted from zero. 

10 TheJvlakersPrivateData Start_address indicates the leading end address of Makers PrivateData() with the relative 
number of bytes from the leading end byte of the zzzzz.clpi file as a unit. The relative number of bytes is counted from 
zero. The padding_word is inserted in accordance with the syntax of the zzzzz.clpi file. N1 , N2, N3, N4 and N5 must 
be zero or optional positive integers. The respective padding words may also assume optional values. 
[0193] The Cliplnfo is now explained. Fig.46 shows the syntax of Cliplnfo. Fig.46 shows the syntax of Cliplnfo. In 

15 the Cliplnfo() is stored the attribute information of corresponding AV stream files (Clip AV stream or Bridge-Clip AV 
stream file). 

[0194] Turning to the syntax of the Cliplnfo shown in Fig.46, version_number is the four character letters indicating 
the version number of this Cliplnfo(). The version_number must be encoded to "0045" in accordance with the ISO 646. 
Length is a 32-bit unsigned integer indicating the number of bytes of CliplnfoQ from directly at back of the length field 
20 to the trailing end of the Cliplnfo(). An 8-bit field of Clip_stream_type indicates the type of the AV stream corresponding 
to the Clip Information file, as shown in Fig.47. The stream types of the respective AV streams will be explained sub- 
sequently. 

[0195] The 32-bit field of offset_SPN gives an offset value of the source packet number of the first source packet 
number of the first source packet of the AV stream (Clip AV stream or the Bridge-Clip AV stream). When the AV stream 

25 file is first recorded on the disc, this offset_SPN must be zero. 

[0196] Referring to Fig.48, when the beginning portion of the AV stream file is erased by editing, offset_SPN may 
assume a value other than 0. In the present embodiment, the relative source packet number (relative address) refer- 
encing the offset_SPN is frequently described in. the form of RSPNxxx, where xxx is modified such that RSPN_xxx is 
RAPN_EP__start. The relative source packet number is sized with the source packet number as a unit and is counted 

30 from the first source packet number of the AV stream file with the value of the offset__SPN as the initial value. 

[01 97] The number of source packets from the first source packet of the AV stream file to the source packet referenced 
by the relative source packet number (SPN_xxx) is calculated by the following equation; 



35 SPN_xxx = RSPN_xxx - offset_SPN. 

Fig.48 shows an instance in which offset_SPN is 4. 

[0198] TS_recording_rate is a 24-bit unsigned integer, which affords an input/output bit rate required for the AV 
stream to the DVR drive (write unit 22) or from the DVR drive (readout unit 28). The record_time_and_date is a 56-bit 
40 field for storing the date and time of recording of the AV stream corresponding to the Clip and is an encoded represen- 
tation of year/month/day/hour/minute in 4-bit binary coded decimal (BCD) for 14 numerical figures. For example, 
2001/2/23:01 :02:03 is encoded to "0x2001 122301 0203". 

[0199] The duration is a 24-bit field indicating the total playback time of the Clip by hour/minute/second based on 
arrival time clocks. This field is six numerical figures encoded in 4-bit binary coded decimal (BCD). For example, 01 : 
45 45:30 is encoded to "0x014530". 

[0200] A flag time_controlled_flag indicates the recording mode of an AV stream file. If this time_controlled_flag is 
1 , it indicates that the recording mode is such a mode in which the file size is proportionate to the time elapsed since 
recording, such that the condition shown by the following equation: 

50 

Ts__average_rate*192/1B8*(t - start_time)- a <= size_clip(t) 
<= TS_average„rate* 192/1 88*{t - start_time) + a 

55 where TS_average_rate is an average bit rate of the transport stream of the AV stream file expressed by bytes/second. 
[0201] In the above equation, t denotes the time in seconds, while startjime is the time point when the first source 
packet of the AV stream file was recorded. The size_clip(t) is 10*192 bytes and a is a constant dependent on 
TS_average_rate. 
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[0202] If time_controlledJlag is set to 0, it indicates that the recording mode is not controlling so that the time lapse 
of recording is proportionate to the file size of the AV stream. For example, the Input transport stream is recorded in a 
transparent fashion. 

[0203] If time_controlled Jlag is set to 1 , the 24-bit field of TS_average_rate indicates the value of TS_average_rate 
5 used in the above equation. If time_controlled_flag is set to 0, this field has no meaning and must be set to 0. For 
example, the variable bit rate transport stream is encoded by the following sequence: First, the transport rate is set to 
the value of TS_recording_rate. The video stream is encoded with a variable bit rate. The transport packet is intermit- 
tently encoded by not employing null packets. 

[0204] The 32-bit field of RSPN_arrival_time_discontinuity is a relative address of a site where arrival timebase 
io discontinuities are produced on the Bridge-Clip AV stream file. The RSPN_arrivalJime_discontinuity is sized with the 
source packet number as a unit and is counted with the value of offset_SPN defined in the Cliplnfof) as from the first 
source packet of the Bridge-Clip AV stream file. An absolute address in the Bridge-Clip AV stream file is calculated 
based on the aforementioned equation: 

15 

SPN_xxx = RSPN_xxx - offset.SPN. 



[0205] The 144-bit field of reserverJor_system_use is reserved for a system. If is_format_identifier_valid flag is 1 , 
it indicates that the field of format_identifier is effective. If is_format_identifier_valid flag is 1, it indicates that the 

20 formatjdentifier field is valid. If is_originaLnetworkJD_valid flag is 1, it indicates that the field of 
is_transport_stream_ID-valid is valid. If the flag is_transport_stream_ID-valid is 1, it indicates that the 
transport_stream_ID field is valid. If is_servece_ID_valid flag is 1 , it indicates that the serveceJD field is valid. 
[0206] If is_country code_valid flag is 1, it indicates that the field country_code is valid. The 32-bit field of 
formatjdentifier indicates the value of fonmat_identifier owned by a registration descriptor (defined in ISO/I EC1 381 8-1 ) 

25 in the transport stream. The 1 6-bit field of original_network_ID indicates the value of the original_network_ID defined 
in the transport stream. 

[0207] The 16-bit field in serveceJD denotes the value of serveceJD defined in the transport stream. The 24-bit 
field of country_code shows a country code defined by IS03166. Each character code is encoded by IS08859-1 . For 
example, Japan is represented as "JPN" and is encoded to "0x4A 0x50 0x4E". The stream Jormat_name is 1 5 character 
30 codes of ISO-646 showing the name of a format organization affording stream definitions of transport streams. An 
invalid byte in this field has a value of 'OxFF'. 

[0208] Formatjdentifier, original_networkJD, transport_stream_ID, serveceJD, country_code and 
streamJormat_name indicate service providers of transport streams. This allows to recognize encoding limitations on 
audio or video streams and stream definitions of private data streams other than audio video streams or SI (service 

35 information). These information can be used to check if the decoder is able to decode the stream. If such decoding is 
possible, the information may be used to initialize the decoder system before starting the decoding. 
[0209] STCJnfo is now explained. The time domain in the MPEG-2 transport stream not containing STC discontin- 
uous points(discontinuous points of the system time base) is termed the STC_sequence. In the Clip, STC_sequence 
is specified by the value of STC_sequenceJd. Figs.50A and SOB illustrate a continuous STC domain. The same STC 

40 values never appear in the same STC_sequence, although the maximum tine length of Clip is limited, as explained 
subsequently. Therefore, the same PTS values also never appear in the same STC_sequence. If the AV stream contains 
N STC discontinuous points, where N > 0, the Clip system time base is split into (N+1) STC_sequences. 
[021 0] STCJnfo stores the address of the site where STC discontinuities (system timebase discontinuities) are pro- 
duced. As explained with reference to Fig.51 , the RSPN_STC_start indicates the address and begins at a time point 

45 of arrival of the source packet referenced by the (k+1)st RSPN_STC_start and ends at a time point of arrival of the 
last source packet. 

[021 1] Fig.52 shows the syntax of the STCJnfo. Turning to the syntax of STCJnfo shown in Fig.52, version_number 
is four character letters indicating the version number of STCJnfo(). The version jiumber must be encoded to "0045" 
in accordance with ISO 646. 

so [021 2] Length is a 32-bit unsigned integer indicating the number of bytes of STC Jnfo() from directly after this length 
field to the end of STCJnfo. If CPI Jype of CPI() indicates TU_map type, 0 may be set in this length field. If CPIJype 
of CPI() indicates EP_map type, the num_of_STC_sequence mut be of a value not less than 1 . 
[0213] An 8-bit unsigned integer of num_of_STC_sequence indicates the number of sequences in the Clip. This 
value indicates the number of the for-loops next following the field. The STC_sequenceJd corresponding to the preset 

55 STC_sequence is defined by the order in which appears the RSPN_STC_start corresponding to the STC_sequence 
in the for-loop containing the RSPN_STC_start. The STC_sequenceJd commences at 0. 

[0214] The 32-bit field of RSPN_STC_start indicates an address at which the STC_sequence commences on the 
AV stream file. RSPN_STC_start denotes an address where system time base discontinuities are produced in the AV 
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stream file. The RSPN_STC_start may also be a relative address of the source packet having the first PCR of the new 
system time base in the AV stream. The RSPI\LSTC_start is of a size based on the source packet number and is 
counted from the first source packet of the AV stream file with the offset_SPN value defined in Cliplnfo() as an initial 
value. In this AV stream file, the absolute address is calculated by the above-mentioned equation, that is 

5 

SPN_xxx = RSPN_xxx - offset_SPN. 

[0215] Programlnfo in the syntax of zzzz.clip shown in Fig.45 is now explained with reference to Fig.53. The time 
10 domain having the following features in the Clip is termed program_sequence. These feature are that the value of 
PCR_PID is not changed, the number of audio elementary streams is also not changed, the PID values in the respective 
video streams are not changed, the encoding information which is defined by VideoCodinglnfo thereof is not changed, 
the number of audio elementary streams is also not changed, the PID values of the respective audio streams are not 
changed, and that the encoding information, which is defined by AudioCodinglnfo thereof, is not changed. 
15 [0216] Program_sequence has only one system time base at the same time point. Program_sequence has a sole 
PMT at the same time point. ProgramlnfoQ stores the address of the site where the program_sequence commences. 
RSPN_program_sequence-start indicates the address. 

[0217] Fig. 54 illustrates the syntax of Programlnfo. Turning to the Programlnfo shown in Fig. 54, version_number is 
four character letters indicating the version number of Programlnfo(). The version_number must be encoded to "0045" 

20 in accordance with ISO 646. 

[0218] Length is a 32-bit unsigned integer indicating the number of bytes of Programlnfo() from directly at back of 
this length field to the end of program(info(). If CPI_type of CPI() indicates the TU_map type, this length field may be 
set to 0. If the CPI_type of CPl() indicates EP_map type, the number_of_programs must be of a value not less than 1 . 
[021 9] An 8-bit unsigned integer of number_of_program_sequences denotes the number of program_sequences in 

25 the Clip. This value indicates the number of for-loops next following this field. If program_sequence in the Clip is not 
changed, 1 must be set in the number of program^sequences. A 32-bit field of RSPN_program_sequence_start is a 
relative address where the program sequence commences on the AV stream. 

[0220] RSPN_program__sequence_start is sized with the source packet number as a unit and is counted with the 
value of offset_SPN defined in the Cliplnfo() as from the first source packet of the AV stream file. An absolute address 
30 in the AV stream file is calculated by: 

SPN_xxx = RSPN_xxx - offset_SPN. 

35 The values of RSPN_program_sequence_start in the for-loop syntax must appear in the rising order. 

[0221] A 16-bit field of PCR_PID denotes the PID of the transport packet containing an effective PCR field effective 
for the program_sequence. An 8-bit field of number_of_audios indicates the number of for-loops containing 
audio_stream_PID and AudioCodinglnfo(). A 16-bit field of video_stream_PID indicates the PID of the transport packet 
containing a video stream effective for its program_sequence. VideoCodinglnfo(), next following this field, must explain 

40 the contents of the video stream referenced by its video_stream_PID. 

[0222] A 1 6-bit field of audio_stream_PID indicates the PID of a transport packet containing the audio stream effective 
for its program sequence. The AudioCodinglnfo(), next following this field, must explain the contents of the video stream 
referenced by its audio_stream_PID. 

[0223] The order in which the values of video_stream_PID in the for-loop of the syntax must be equal to the sequence 
45 of PID encoding of the video stream in the PMT effective for the program_sequence. Additionally, the order in which 
the values of audio_stream_PID appears in the for-loop of the syntax must be equal to the sequence of PID encoding 
of the audio stream in the PMT effective for the program_sequence. 

[0224] Fig. 55 shows the syntax of VideoCodinglnfo in the syntax of the Programlnfo shown in Fig.54. Turning to the 
syntax of the VideoCoding Info shown in Fig. 55, an 8-bit field of videojormat indicates the video format corresponding 
50 to video_stream_PID in Programlnfo(), as shown in Fig.56. 

[0225] Referring to Fig.57, an 8-bit field of frame_rate indicates the video frame rate corresponding to the 
video_stream_PID in ProgramlnfoQ. An 8-bit field of display_aspect_ratio indicates a video display aspect ratio corre- 
sponding to video_stream_PID in Programlnfo(). 

[0226] Fig.59 shows the syntax of AudioCodinglnfo in the syntax of Programlnfo shown in Fig.54. Turning to the 
55 syntax of the AudioCoding Info shown in Fig.59, an 8-bit field of audiojonmat indicates the audio encoding method 
corresponding to audio_stream_PID in Program I nfo(), as shown in Fig.60. 

[0227] An 8-bit field of audio_component_type indicates an audio component type corresponding to 
audio_stream_PID in ProgramlnfoQ as shown in Fig.61 , whilst an 8-bit field of sampling ^frequency indicates an audio 



20 



EP 1 198 133 A1 



sampling frequency corresponding to audio_stream_PID In Programing) as shown in Fig. 62. 
[0228] The CPI (Characteristics Point Information) in the syntax of zzzzz.clip shown in Fig.45 is explained. The CPI 
is used for correlating the time information in the AV stream with the address in its file. The CPI is of two types, namely 
EPjnap and TUjnap. In Fig.63, if CPUype in CP\() is EPjnap, its CPI() contains EP_map. In Fig.64, if CPIJype in 
5 CPI() is TILmap, its CPI() contains TU_map. One AV stream has one EPjnap or one TUjnap, If the AV stream is 
an SESF transport stream, the corresponding Clip must own an EPjnap. 

[0229] Fig.65 show the syntax of CPI. Turning to the syntax of CPI shown in Fig.65, the version_number is four 
character letters indicating the version number of this CPI(). The version_number must be encoded to "0045" in ac- 
cordance with ISO 646. Length is a 32-bit unsigned integer indicating the number of bytes as from directly after this 
10 length field to the trailing end of the CPI(). The CPLtype is a 1 -bit flag and indicates the CPI type of Clip, as shown in 
Fig.66. 

[0230] The EPjnap in the CPI syntax shown in Fig.65 is explained. There are two types of the EP_map, that is 
EP_map for a video stream and an EP_map for an audio stream. The EPjnapJype in the EPjnap differentiates 
between these EPjnap types. If the Clip contains one or more video streams, the EPjnap for the video stream must 
15 be used. If the Clip does not contain a video stream but contains one or more audio streams, the EPjnap for the audio 
stream must be used. 

[0231] The EPjnap for a video stream is explained with reference to Fig. 67. The EPjnap for the video stream has 
data stream_PID, PTS_EP_start and RSPN_EP_start. The stream_PID shows the PID of the transport packet trans- 
mitting a video stream. The PTS_EP_start indicates the PTS of an access unit beginning from the sequence header 
20 of the video stream. The RSPN_RP_start indicates the address of a source packet including the first byte of the access 
unit referenced by the PTS_EP_start in the AV stream. 

[0232] A sub table, termed EP_mapJor„one_stream_PID() is created from one video stream transmitted by the 
transport packet having the same PID to another. If plural video streams exist in the Clip, the EPjnap may contain 
plural EP_map_for_one_stream_PID(). 
25 [0233] The EPjnap for audio stream has data stream_PID, PTS_EP_start and RSPN_EP_start. The stream_PID 
shows a PID of a transport packet transmitting an audio stream. The PTS_EP_start shows the PTS of an accessing 
unit in the audio stream. The RSPN_EP-start indicates an address of a source packet containing a first byte of the 
access unit referenced by PTS„EP_start of the AV stream. 

[0234] The sub table termed EP_map_for_one„stream_PID() is created from one audio stream transmitted by the 
30 transport packet having the same PID to another. If there exist plural audio streams in the Clip, EPjnap may contain 
plural EP map-_for„one stream_PID(). 

[0235] Turning to the relation between EPjnap and STCJnfo, one EP_map_for„one_stream_PID() is created in 
one table irrespective of discontinuous points in the STC. Comparison of the value of the RSPN_EP_start to the value 
of RSPN_STC_start defined in STCJnfoQ reveals the boundary of data of EPjnap belonging to respective 

35 STC_sequences (see Fig. 68). The EPjnap must have one EP_map_for_one_stream_PID for a continuous stream 
range transmitted by the same PID. In the case shown in Fig. 69, program#1 and program#3 have the same video PID, 
however, the data range is not continuous, so that EP_map_for_one_stream_PID must be provided for each program. 
[0236] Fig.70 shows the EPjnap syntax. Byway of explanation of the EPjnap syntax shown in Fig. 70, the EP_type 
is a 4-brt field and shows the EPjnap entry point type, as shown in Fig.71 . The EPJype shows the semantics of the 

40 data field next following this field. If Clip includes one or more video stream, the EPJype must be set to 0 (Video'). 
Alternatively, if the Clip contains no video stream but contains one or more audio stream, then EPJype must be set 
to 1 ('audio'). 

[0237] The 16-bit field of number_of_stream_PIDs indicates the number of times of loops of the for-loop having 
number_of_stream_PIDs in the EPjnapQ as a variable. The 16-bit field of stream_PID(k) indicates the PID of the 
45 transport packet transmitting the number k elementary stream (video or audio stream) referenced by 
EP_map_for_one_stream_PID (num_EP_entries(k)). If EPJype is 0 ('video'), its elementary stream must be a video 
stream. If EPJype is equal to 1 ('audio'), its elementary stream must be the audio stream. 

[0238] The 16-bit field of num_EP_entries(k) indicates the num_EP_entries(k) referenced by EPjnap_entries(k)). 
The EPjnap Jor_one__stream_PID_ Start_address(k): This 32-bit field indicates the relative address position at which 
50 the EP_mapJor_one_stream_PID(num_EP_entries(k)) begins in the EP_map(). This value is indicated by the size as 
from the first byte of the EP_map(). 

[0239] Padding__word must be inserted in accordance with the EPjnapQ syntax. X and Y must be optional positive 
integers. The respective padding words may assume any optional values. 

[0240] Fig.72 shows the syntax of EP_mapJor_one_stream_PID. By way of explanation of the syntax of the 
55 EP_mapJor_one_stream_PID shown in Fig.72, the semantics of the 32-bit field of PTS_EP_start differs with the 
EPJype defined by EP_map(). If EPJype is equal to 0 (Video'), this field has upper 32 bits of the 33-bit precision PTS 
of the access unit beginning with a sequence header of the video stream. If the EPJype is equal to 1 ('audio'), this 
field has upper 32 bits of PTS of 33 bit precision of the access unit of the audio stream. 
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[0241] The semantics of the 32-bit field of RSPN_EP_start differs with the EP Jype defined in EP_map(). If EP_type 
is equal to 0 (Video'), this field indicates the relative address of the source packet including the first byte of the sequence 
header of the access unit referenced by the PTS_EP_start in the AV stream. Alternatively, if EPJype is equal to 1 
('audio'), this field indicates the relative address of the source packet containing the first byte in the audio stream of 
5 the access unit referenced by the PTS_EP_start in the AV stream. 

[0242] RSPN_EP_start is of a size which is based on the source packet number as a unit, and is counted from the 
first source packet of the AV stream file, with the value of the offset_SPN, defined in Cliplnfo(), as an initial value. The 
absolute address in the AV stream file is calculated by 

10 

SPNjcxx = RSPN_xxx - offset_SPN. 

[0243] It is noted that the value of the RSPN_EP_start in the syntax must appear in the rising order. 
[0244] The TUjnap is now explained with reference to Fig. 73. TlLmap forms a time axis based on the source 
*s packet arrival time clock (timepiece of the arrive time base). This time axis is termed TU_map_time_axis. The point of 
origin of TU_map Jime_axis is indicated by offsetjime in the TU_map(). TU_map_time_axis is divided in a preset unit 
as from offset_time, this unit being termed time_unit. 

[0245] In each time_unit in the AV stream, addresses on the AV stream file of the source packet in the first complete 
form are stored in TU_map. These addresses are termed RSPN_time_unit_start. The time at which begins the k(k > 
20 o)th time_unit on the TU_map_time_axis is termed TU_startJime(k). This value is calculated based on the following 
equation: 

TU_start_time(k) = offsetjime + k*time__unit_size. 

25 

[0246] It is noted that TU_startJime(k) has a precision of 45 kHz. 

[0247] Fig.74 shows the syntax of TU_map. By way of explanation of the TU_map syntax shown in Fig.74, the 32-bit 
field of offsetjime gives an offset time relative to TU_mapJime_axis. This value indicates the offset time relative to 
the first time_unit in the Clip. The offsetjime is of a size based on 45 kHz clock derived from the 27 MHz precision 
30 arrival time clocks as unit. If the AV stream is to be recorded as new Clip, offsetjime must be set to 0. 

[0248] The 32-bit field of time_unit_size affords the size of the time_unit, and is based on 45 kHz clocks, derived 
from the 27 MHz precision arrival time clocks, as unit. Preferably, time_unit_size is not longer than one second 
(time_unit_size < 45000). The 32 bit field of numberjrf Jime_unit_entries indicates the number of entries stored in 
TU_map(). 

35 [0249] The 32-bit field of RSNJime_unit_start indicates the relative address of a site in the AV stream at which 
begins each time_unit. RSN Jimejjnit_start is of a size based on the source packet number as unit and is counted 
with the value of offset_SPN defined in Cliplnfo() as from the first source packet of the AV stream file as an initial value. 
The absolute address in the AV stream file is calculated by 

40 

SPN_xxx = RSPN_xxx - offset.SPN. 

[0250] It is noted that the value of RSN Jime_unit_start in the for-loop of the syntax must appear in the rising order. 
If there is no source packet in the number (k+1) time_unit, the number (k+1) RSNJime_unit_start must be equal to 
45 the number k RSPN Jime_unit_start. 

[0251] By way of explanation of the ClipMark in the syntax of zzzzz.clip shown in Fig.45, the ClipMark is the mark 
information pertinent to clip and is stored in the ClipMark. This mark is not set by a user, but is set by a recorder 
(recording and/or reproducing apparatus 1). 

[0252] Fig.75 shows the ClipMark syntax. By way of explanation of the ClipMark syntax shown in Fig.75, the 
so version_number is four character letters indicating the version number of this ClipMark. The version_number must be 
encoded in accordance with ISO 646 to "0045". 

[0253] Length is a 32-bit unsigned integer indicating the number of bytes of the ClipMark() as from directly after the 
length field to the trailing end of ClipMark(). The number_of_Clip_marks is a 16-bit unsigned integer indicating the 
number of marks stored in ClipMark and may be equal to 0. Mark Jype is an 8-bit field indicating the mark type and is 
55 encoded in accordance with the table shown in Fig. 76. 

[0254] Markjime_stamp is a 32-bit field and stores the time stamp indicating a pointer having a specified mark. The 
semantics of markjime_stamp differs with CP! Jype in the PlayList(), as shown in Fig.77. 

[0255] If CPl.type in CPI() indicates the EP_map type, this 8-bit field indicates the STC_sequenceJd of the contin- 
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uous STC domain where there is placed mark_time_stamp. If CPIJype in CPI() indicates TU_map type, this 8-bit field 
has no meaning but is set to 0. The 8-bit field of Character_set indicates the indicating method of character letters 
encoded in the mark_name field. The encoding method corresponds to the value shown in Fig. 19. 
[0256] The 8-bit field of namejength indicates the byte length of the mark name shown in the mark_name field. This 
s mark_name field indicates the mark name. The byte number corresponding to the number of the namejength from 
left of this field is the effective character number and denotes the mark name. In the mark_name field, the values next 
following these effective character letters may be arbitrary. 

[0257] The field of ref__thumbnail_index indicates the information of the thumbnail picture appended to the mark. If 
the reMhumbnailjndex field is of a value different from OxFFFF, a thumbnail picture is added to its mark, with the 
10 thumbnail picture being stored in the mark.thumb file. This picture is referenced using the value of refjhumbnail_index 
in the mark.thumb file. If the reMhumbnail_index field is of a value equal to OxFFFF, a thumbnail picture is not appended 
to its mark. 

[0258] MakerPrivateData has already been explained with reference to Fig. 22 and hence is not explained here spe- 
cifically. 

13 [0259] Next, thumbnaiUnformation is explained. A thumbnail picture is stored in a menu.thmb file or in a mark.thmb 
file. These files are of the same syntax structure and own a sole Thumbnail(). The menu.thmb file stores a picture 
representing respective PlatyLists, The totality of menu thumbnails are stored in the sole menu.thmb file. 
[0260] The mark.thmb file stores a mark thumbnail picture, that is a picture representing a mark point. The totality 
of mark thumbnails corresponding to the totality of PlayLists and Clips are stored in the sole mark.thmb file. Since the 

20 thumbnails are frequently added or deleted, the operation of addition and partial deletion must be executable readily 
and speedily. For this reason, Thmbnail() has a block structure. Picture data is divided into plural portions each of which 
is stored in one tn_block. One picture data is stored in consecutive tn_blocks. In the string of tn_blocks, there may 
exist a tn_block not in use. The byte length of a sole thumbnail picture is variable. 

[0261] Fig. 78 shows the syntax of menu.thmb and mark.thmb and Fig. 79 the syntax of Thumbnail in the syntax of 
25 menu.thmb and mark.thmb shown in Fig. 78. By way of explanation of the syntax of Thumbnail, shown in Fig.79, 
version_number is four character letters denoting the version number of this ThumbnailQ. Version_number must be 
encoded to "0045" in accordance with ISO 646. 

[0262] Length is a 32-bit unsigned integer indicating the number of bytes of MakerPrivateData() as from directly at 
back of the length field up to the trailing end of Thumbnail(). Tu_block_start_address is a 32-bit unsigned integer 

30 indicating the leading end byte address of the first tn_block, in terms of the relative number of bytes as from the leading 
end byte of Thumbnail() as a unit. The number of relative bytes is counted from 0. Number_of_thumbnails is a 16-bit 
unsigned integer which gives the number of entries of a thumbnail picture contained in Thumbnail{). 
[0263] Tu_block_size is a 16-bit unsigned integer which gives the size of one tnjslock, in terms of 1024 bytes as a 
unit. If, for example, tn_block_size = 1 , it indicates that the size of one tn_block is 1024 bytes. Number_of_tn_blocks 

35 is a 1 1 6-bit unsigned integer indicating the number of entries of tn_block in this Thumbnail. ThumbnaiM ndex is a 1 6-bit 
unsigned integer indicating the index number of the thumbnail picture represented by the thumbnail information for 
one for-loop beginning from the thumbnail_index field. The value OxFFFF must not be used as Thumbnail_index. This 
ThumbnaiUndex is referenced by ref_th urn bnailj ndex in UiApplnfoVolume(), UIApplnfoPlayList(), PlayListMark() and 
ClipMarkQ. 

40 [0264] ThumbnaiLpictureJormat is an 8-bit unsigned integer representing the picture format of a thumbnail picture 
and assumes a value shown in Fig. 80. In the table, DCF and PNG are allowed only in menu.thumb. The mark thumbnail 
must assume the value of "0x00" (MPEG-2 Video 1 -picture). 

[0265] Picture_data_size is a 32-bit unsigned integer indicating the byte length of a thumbnail picture in terms of 
bytes as a unit. Start_tn_block_n umber is a 16-bit unsigned integer indicating the tn_block number of the tn_block 
45 where data of the thumbnail picture begins. The leading end of the thumbnail picture data must coincide with the leading 
end of the tn _btock. The tn_block number begins from 0 and is relevant to the value of a variable k in the for-loop of 
tn_block. 

[0266] X_picture_length is a 16-bit unsigned integer indicating the number of pixels in the horizontal direction of a 
frame of a thumbnail picture. Y_picturejength is a 1 6-bit unsigned integer indicating the number of pixels in the vertical 

so direction of a frame of a thumbnail picture. Tn_block is an area in which to store a thumbnail picture. All tn_block in 
the ThumbnailQ are of the same size (fixed length) and are of a size defined by tn_block_sixe. 
[0267] Figs. 81 A and 81 B schematically show how thumbnail picture data are stored in tn_block. If, as shown in Figs. 
81 A and 81 B, the thumbnail picture begins at the leading end of tn_block, and is of a size exceeding 1 tn_block, it is 
stored using the next following tn_block. By so doing, data with a variable length can be managed as fixed length data, 

55 so that the editing of deletion can be coped with by simpler processing. 

[0268] An AV stream file is now explained. The AV stream file is stored in the "M2TS a directory (Fig. 14). There are 
two types of the AV stream file, namely a Clip A stream file and a Bridge-Clip AV stream file. Both AV streams must 
be of the structure of DVR MPEG-2 transport stream file as hereinafter defined. 
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[0269] First, the DVR MPEG2 transport stream is explained. The structure of the DVR MPEG-2 transport stream is 
shown in Fig.82. The AV stream file has the structure of a DVR MPEG 2 transport stream. The DVR MPEG 2 transport 
stream is made up of an integer number of Aligned units. The size of the aligned unit is 6144 bytes (2048*3 bytes). 
The Aligned unit begins from the first byte of the source packet. The source packet is 192 bytes long. One source 
5 packet is comprised of TP_extra_header and a transport packet. TP_extra_header is 4 bytes long, with the transport 
packet being 188 bytes long. 

[0270] One Aligned unit is made up of 32 source packets. The last Aligned unit in the DVR MPEG 2 transport stream 
is also made up of 32 source packets. Therefore, the DVR MPEG 2 transport stream ends at a boundary of the Aligned 
unit. If the number of the transport packets of the input transport stream recorded on a disc is not a multiple of 32, a 
10 source packet having a null packet (transport packet of PID = 0x1 FFFF) must be used as the last Aligned unit. The file 
system must not use excess information in the DVR MPEG 2 transport stream. 

[0271] Fig. 83 shows a recorder model of the DVR MPEG 2 transport stream. The recorder shown in Fig. 83 is a 
conceptual model for prescribing the recording process. The DVR MPEG 2 transport stream obeys this model. 
[0272] The input timing of the MPEG 2 transport stream is now explained. The input MPEG 2 transport stream is a 
15 full transport stream or a partial transport stream. The input MPEG 2 transport stream must obey the ISO/IEC1 381 8-1 
or ISO/IEC 13818-9. The number i byte of the MPEG 2 transport stream is input simultaneously at time t(i) to T-STD 
(transport stream system target decoder provided for in SO/IEC13818-1) and to the source packetizer. Rpk is an in- 
stantaneous maximum value of the input rate of the transport packet. 

[0273] A 27 MHz PLL 52 generates a frequency of 27 MHz clock. The 27 MHz clock frequency is locked at a value 
20 of the program clock reference (PCR) of the MPEG 2 transport stream. An arrival time clock counter 53 counts the 
pulses of the 27 MHz frequency. Arrival_time_clock(i) is a count value of the arrival time clock counter at time t(i). 
[0274] A source packetizer 54 appends TP_extra_header to the totality of the transport packets to create a source 
packet. Arrival_time_stamp indicates the time when the first byte of the transport packet reaches both the T-STD and 
the source packetizer. Arrival_time_stamp(k) is a sampled value of the Arrival_time_clock(k) as represented by the 
25 following equation: 

arrival_time_stamp(k) = arrival_time_clock(k)% 230 

30 where k denotes the first byte of the transport packet. 

[0275] If the time separation between two neighboring transport packets is 230/2 7000000 sec (about 40 sec) or 
longer, the difference of the arrival_time_stamp of the two transport packets should be set to 230/2 7000000 sec. The 
recorder is provided for such case. 

[0276] A smoothing buffer 55 smoothes the bitrate of the input transport stream. The smoothing buffer must not 
35 overflow. Rmax is the output bitrate of the source packet from the smoothing buffer when the smoothing buffer is not 
null. If the smoothing buffer is null, the output bitrate of the smoothing buffer is 0. 

[0277] Next, the parameters of the recorder model of the DVR MPEG 2 transport stream are explained. The value 
of Rmax is given by TS_recording_rate as defined in CliplnfoQ associated with the AV stream file. This value may be 
calculated from the following equation: 

40 

Rmax = TS_recording_rate* 192/1 88 

where the value of TS_recording_rate is of a size in bytes/second. 
45 [0278] If the input transport stream is an SESF transport stream, Rpk must be equal to TS_recording_rate as defined 
in Cliplnfo() associated with the AV stream file. If the input transport stream is not an SESF transport stream, reference 
may be made to values defined e.g., in a descriptor of the MPEG 2 transport stream, such as 
maximum_bitrate_descriptor or partial_stream_descriptor for this value. 

[0279] If the input transport stream is an SESF transport stream, the smoothing buffer size is 0. If the input transport 
so stream is not an SESF transport stream, reference may be made to values defined in the descriptor of the MPEG 2 
transport stream, such as, for example, the values defined in the smooth ing_buffer_descriptor, short_smoothing 
_buffer_descriptor or in the partial_transport_stream_descriptor. 

[0280] For the recorder and the player (reproducing apparatus), a sufficient size buffer needs to be provided. The 
default buffer size is 1536 bytes. 
55 [0281] Next, a player model of the DVR MPEG 2 transport stream is explained. Fig.84 shows a player model of the 
DVR MPEG 2 transport stream. This is a conceptual model for prescribing the reproduction process. The DVR MPEG 
2 transport stream obeys this model. 

[0282] A 27 MHz X-tal 61 generates the frequency of 27 MHz. An error range of the 27MHx frequency must be ±30 
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ppm (2 7000000 ±810 Hz). The arrival time clock counter 62 is a binary counter for counting the pulses of the frequency 
of 27 MHz. Arrival_time_clock{i) is a count value of the arrival time clock counter at time t(i). 
[0283] In the smoothing buffer 64, Rmax is the input bitrate of the source packet to the smoothing buffer when the 
smoothing buffer is not full. If the smoothing buffer is full, the input bitrate to the smoothing buffer is 0. 
5 [0284] By way of explaining the output timing of the MPEG 2 transport stream, if the arrivaljime_stamp of the current 
source packet is equal to 30 bits on the LSB side of arrival_time_clock(i), the transport packet of the source packet is 
removed from the smoothing buffer Rpk is an instantaneous maximum value of the transport packet rate. The overflow 
of the smoothing buffer is not allowed. 

[0285] The parameters of the player model of the DVR MPEG 2 transport stream are the same as those of the 

10 recorder model of the DVR MPEG 2 transport stream described above. 

[0286] Fig. 85 shows the syntax of the source packet. Trans port _packet() is an MPEG 2 transport stream provided 
in ISO/IEC 13818-1. The syntax of TP Extra-header in the syntax of the source packet shown in Fig. 85 is shown in 
Fig. 86. By way of explaining the syntax of the TP_Extra- header, shown in Fig. 86, copy_permissionjndicator is an 
integer representing the copying limitation of the payload of the transport packet. The copying limitation may be copy 

'5 free, no more copy, copy once or copying prohibited. Fig.87 shows the relation between the value of 
copy_permissionjndicator and the mode it designates. 

[0287] Copyjsermissionjndicator is appended to the totality of transport packets. If the input transport stream is 
recorded using the IEEE1394 digital interface, the value of copy_permission_indicator may be associated with the 
value of EMI (encryption mode indicator). If the input transport stream is recorded without employing the IEEE1394 
20 digital interface, the value of copy_permission_indicator may be associated with the value of the CCI embedded in the 
transport packet. If an analog signal input is self-encoded, the value of copy_permission_indicator may be associated 
with the value of CGMS-A of the analog signal. 

[0288] Arrival_time_stamp is an integer having a value as specified by arrival_time_stamp in the following equation: 

25 

arrival_time_stamp(k) = arrival_time_clock(k)%230. 

[0289] By way of defining the ClipAV stream, the ClipAV stream must have a structure of the DVR MPEG 2 transport 
stream defined as described above. Arrival_time_clock(i) must increase continuously in the Clip AV stream. If there 
30 exists a discontinuous point of the system time base (STC base) in the Clip AV stream, arrival_time_clock(i) in the Clip 
AV stream must increase continuously. 

[0290] The maximum value of the different of the amval_time_clock(i) between the beginning and the end of the Clip 
AV stream must be 26 hours. This limitation guarantees that, if there is no discontinuous point in the system time base 
(STC base) in the MPEG 2 transport stream, the PTS (presentation time stamp) of the same value never appears in 
35 the Clip AV stream. The MPEG 2 system standard provides that the PTS has a wraparound period of 233/90000 sec 
(about 26.5 hours). 

[0291] By way of defining the Bridge-Clip AV stream, the Bridge-Clip AV stream must have a structure of the DVR 
MPEG 2 transport stream defined as described above. The Bridge-Clip AV stream must include a discontinuous point 
of one arrival time base. The transport stream ahead and at back of the discontinuous point of the arrival time base 

40 must obey the encoding limitations and the DVR-STD as later explained. 

[0292] The present embodiment supports the video-audio seamless connection between Playltems being edited. 
Seamless connection between Playltems guarantees "continuous data supply" to the player/decoder and "seamless 
decoding processing". The "continuous data supply" is the capability of guaranteeing data supply to the decoder at a 
bitrate necessary to prevent buffer underflow. In order to enable data to be read out from the disc as data real-time 

45 properties are assured, data is to be stored in terms of a continuous block of a sufficiently large size as a unit. 

[0293] The "seamless decoding processing" means the capability of a player in displaying audio video data recorded 
on the disc without producing pause or gap in the playback output of the decoder. 

[0294] The AV stream, referenced by the seamless connected Playltems, is explained. Whether or not the seamless 
display of a previous Playltem and the current Playltem is guaranteed may be verified from the connection_condition 
so field defined in the current Playltem. There are two methods for seamless connection of Playltems, that is a method 
employing Bridge-Clip and a method not employing Bridge-Clip. 

[0295] Fig. 88 shows the relation between the previous Playltem and the current Playltem in case of employing 
Bridge-Clip. In Fig. 88, the stream data, read out by the player, is shown shaded. In Fig. 88, TS1 is made up of shaded 
stream data of the Clipl (Clip AV stream) and shaded stream data previous to RSPN_arrival_time_discontinuity. 
55 [0296] The shaded stream data of Clipl of TS1 is stream data from an address of a stream required for decoding 
the presentation unit corresponding to IN_item of the previous Playltem (shown as IN-time1 in Fig.88) up to the source 
packet referenced by RSPN_exit_from_previous_Clip. The shaded stream data prior to 
RSPN_arriva!_time_discontinuity of Bridge-Clip contained in TS1 is stream data as from the first source packet of 
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Bridge-Clip up to the source packet directly previous to the source packet referenced by 
RSPN_arrivaLtime_discontinuity. 

[0297] In Fig. 88, TS2 is made up of shaded stream data of Clip 2 (Clip AV stream) and shaded stream data subse- 
quent to RSPN_am'valJime_discontinuity of Bridge-Clip. The shaded stream data as from the 

5 RSPN_arrival_time_discontinuity of Bridge-Clip contained in TS2 stream data from the source packet referenced by 
RSPN_arriva1_time_discontinuity to the last source packet of Bridge-Clip. The shaded stream data of Clip2 of TS2 is 
stream data from the source packet referenced by RSPN_enter_to_current_Clip to the address of the stream required 
for decoding the presentation unit corresponding to OUTJime of current Playltem (shown by OUT_time2 in Fig.88). 
[0298] Fig. 89 shows the relation between the previous Playltem and the current Playltem in case of not employing 

10 Bridge-Clip. In this case, the stream data read out by the player is shown shaded. In Fig.89.TS1 is made up of shaded 
stream data of the Clipl (Clip AV stream). The shaded stream data of Clipl of TS1 is data beginning at an address of 
a stream necessary in decoding a presentation unit corresponding to INJime of the previous Playltem, shown at 
INjimel in Fig. 89 as far as the last source packet of Clipl . 
[0299] In Fig.89, TS2 is shaded stream data of Clip2 (Clip AV stream). 

15 [0300] The shaded stream data of Clip2 of TS2 is stream data beginning at a first source packet of Clip2 as far as 
an address of the stream necessary for decoding the presentation unit corresponding to OUTJime of current Playltem 
(shown at OUT_time2 in Fig.89). 

[0301] In Figs. 88 and 89, TS1 and T2 are continuous streams of the source packet. Next, the stream provisions of 
TS1 and TS2 and the connection conditions therebetween are scrutinized. First, encoding limitations for seamless 

20 connection are scrutinized. By way of limitations on the encoding structure of a transport stream, the number of pro- 
grams contained in TS1 and TS2 must be 1 . The number of video streams contained in TS1 and TS2 must be 1 . The 
number of audio streams contained in TS and TS2 must be 2 or less. The numbers of the audio streams contained in 
TS1 and TS2 must be equal to each other. It is also possible for elementary streams or private streams other than 
those depicted above to be contained in TS1 and/or TS2. 

25 [0302] The limitations on the video bitstream are now explained. Fig. 90 shows atypical seamless connection indi- 
cated by a picture display sequence. In order for a video stream to be demonstrated seamlessly in the vicinity of a 
junction point, unneeded pictures displayed at back of OUT_time1 (OUTJime of Clipl ) and ahead of IN Jime2 (INJime 
of Clip2) must be removed by a process of re-encoding the partial stream of the Clip in the vicinity of the junction point. 
[0303] Fig. 91 shows an embodiment of realizing seamless connection using BridgeSequence. The video stream of 

30 Bridge-Clip previous to RSPN_arrivalJimejJiscontinuity is comprised of an encoded video stream up to a picture 
corresponding to OUTJimel of Clipl of Fig. 90. This video stream is connected to the video stream of previous Clipl 
and is re-encoded to form an elementary stream conforming to the MPEG2 standard. 

[0304] The video stream of Bridge-Clip subsequent to RSPN_arrivalJime_discontinuity is made up of an encoded 
video stream subsequent to a picture corresponding to IN Jime2 of Clip2 of Fig. 90. The decoding of this video stream 

35 can be started correctly for connecting the video stream to the next following Clip2 video stream. Re-encoding is made 
such that a sole continuous elementary stream conforming to M PEG 2 standard will be formed. For creating Bridge-Clip, 
several pictures in general need to be re-encoded, whilst other pictures can be copied from the original Clip. 
[0305] Fig. 92 shows an embodiment of realizing seamless connection without employing BridgeSequence in the 
embodiment shown in Fig. 90. The Clipl video stream is comprised of an encoded video stream as far as the picture 

40 corresponding to OUTJimel of Fig.90 and is re-encoded so as to give an elementary stream conforming to the MPEG2 
standard. In similar manner, the video stream of Clip2 is made up of encoded bitstreams subsequent to the picture 
associated with INJime2 of Clip2 of Fig.90. These encoding bitstreams are already re-encoded to give a sole contin- 
uous elementary stream conforming to the MPEG2 standard. 

[0306] By way of explaining encoding limitations of the video stream, the frame rates of the video streams of TS1 
45 and TS2 must be equal to each other. The video stream of TS1 must be terminated at sequence_end_code. The video 
stream of TS2 must commence at Sequence header, GOP Header and with an l-picture. The video stream of TS2 
must commence at a closed GOP. 

[0307] The video presentation units defined in a bitstream (frame or field) must be continuous with a junction point 
in-between. No gap of the fields or frames are allowed to exist at junction points. In case of encoding employing 3-2 
50 pulldown, it may be necessary to rewrite "topjield Jirst" and "repeat Jirst Jield" flags. Alternatively, local re-encoding 
may be made to prevent field gaps from being produced. 

[0308] By way of explaining encoding limitations on the audio bitstream, the audio sampling frequency of TS1 and 
that of TS2 must be equal to each other. The audio encoding method of TS1 and that of TS2 (for example, MPEG1 
layer 2, AC-3, SESF LPCM and AAC) must be equal to each other. 
55 [0309] By way of explaining encoding limitations on MPEG-2 transport stream, the last audio frame of the audio 
stream of TS1 must contain audio samples having a display timing equal to the display end time of the last display 
picture of TS1 . The first audio frame of the audio stream of TS2 must contain an audio sample having a display timing 
equal to the display start timing of the first display picture of TS2. 
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[0310] At a junction point, no gap may be allowed to exist in a sequence of the audio presentation units. As shown 
in Fig.93, there may be an overlap defined by the length of the audio presentation unit less than two audio frame 
domains. The first packet transmitting an elementary stream of TS2 must be a video packet. The transport stream at 
the junction point must obey the DVR-STD which will be explained subsequently. 
5 [0311] By way of explaining limitations on the Clip and Bridge-Clip, no discontinuities in the arrival time base are 
allowed to exist in TS1 or in TS2. 

[0312] The following limitations are applied only to the case of employing the Bridge-Clip. The Bridge-Clip AV stream 
has a sole discontinuous point in the arrival time base only at a junction point of the last source packet of TS 1 and the 
first source packet of TS2. The SPN_arrival_time_discontinuity defined in Cliplnfo() represents an address of the dis- 

10 continuous point, which must represent the address referencing the first source packet of TS2. 

[0313] The source packet referenced by RSPN_exit_from_previous_Clip defined in BridgeSequencelnfo() may be 
any source packet in Clipl . It is unnecessary for this source packet to be a boundary of the Aligned unit. The source 
packet referenced by RSPr\LenterJo_current_Clip defined in BridgeSequencelnfo() may be any source packet in 
Clip2. It is unnecessary for this source packet to be a boundary of the Aligned unit. 

15 [0314] By way of explaining limitations on Playltem, the OUTJime of the previous Play Item (OUTjime 1 shown in 
Fig. 89) must represent the display end time of the last video presentation unit of TS1 . The INJime of the current 
PlayTime (IN_time2 shown in Fig. 88 and 89) must represent the display start time of the first presentation unit of TS2. 
[0315] By way of explaining the limitations on the data allocation in case of employing Bridge-Clip by referring to Fig. 
94, the seamless connection must be made to guarantee continuous data supply by the file system. This must be 

20 realized by arranging the Bridge-Clip AV stream, connecting to Clipl (Clip AV stream file) and Clip2 (Clip AV stream 
file), such as to satisfy data allocation prescriptions. 

[0316] RSPN_exit_from_previous_C!ip must be selected so that the stream portion of Clipl (Clip AV stream file) 
previous to RSPN_exitJrom_previous_Clip will be arranged in a continuous area not less than half fragment. The data 
length of the Bridge-Clip AV stream must be selected so that the data will be arranged in the continuous area not less 

25 than half fragment. RSPN__enterJo_current_Clip must be selected so that the stream portion of Clip2 (Clip AV stream 
file) subsequent to RSPN_enter__to__current_Clip will be arranged in a continuous area not less than half fragment. 
[0317] By way of explaining data allocation limitations in case of seamless connection not employing Bridge-Clip, 
by referring to Fig. 95, the seamless connection must be made so as to guarantee continuous data supply by the file 
system. This must be realized by arranging the last portion of the Clipl (Clip AV stream file) and the first portion of the 

30 Clip2 (Clip AV stream file) so that the provisions on data allocation will be met. 

[0318] The last stream portion of Clipl (Clip AV stream file) must be arranged in a continuous area not less than one 
half fragment. The first stream portion of Clip2 (Clip AV stream file) must be arranged in a continuous area not less 
than one half fragment. 

[0319] In a case where digital AV signals each having a predetermined bit rate are fragmented and recorded on a 
35 disc, to assure that the recorded digital AV signals can be read out from the recording medium 1 00 in the predetermined 
bit rate, the size of one continuous recording area must satisfy the following condition: 

S*8 / (S*8/Rud + Ts) >= Rmax 

40 

where 

S: minimum size of one continuous recording area [Byte] 

Ts: access time of full stroke from one recording area to next recording area [second] 
45 Rud: bit rate for reading out from recording medium [bit/second] 

Rmax: bit rate of AV stream [bit/second], 

[0320] That is, the data must be arranged so that S byte or more data in the AV stream may be recorded continuously 
on the disc. 

so [0321] The data must be arranged so that the size of the half fragment may be S byte or more. 

[0322] Next, DVR-STD is explained. This DVR-STD is a conceptual model for modeling the decoding processing in 
the generation and verification of the DVR MPEG 2 transport stream. The DVR-STD is also a conceptual model for 
modeling the decoding processing in the generation and verification of the AV stream referenced by two Playltems 
seamlessly connected to each other as described above. 

55 [0323] Fig.96 shows a DVR-STD model. The model shown in Fig. 96 includes, as a constituent element, a DVR 
MPEG 2 transport stream player model. The notation of n, Tbn, Mbn, Ebn, Tbsys, Bsys, Rxn, Rbxn, Rxsys, Dn, Dsys, 
On and P9(k) is the same as that defined in T-STD of ISO/I EC 13818-1 , wherein n is an index number of an elementary 
stream and TBn is a transport buffer of the elementary stream n. 
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[0324] MBn is a multiplexing buffer of the elementary stream n and exists only for the video stream. EBn is an 
elementary stream buffer of the elementary stream n and is present only for the video stream. TBsys is a main buffer 
in a system target decoder for the system information for a program being decoded. Rxn is a transmission rate with 
which data is removed from TBn. Rbxn is a transmission rate with which the PES packet payload is removed from MBn 
s and is present only for a video stream. 

[0325] Rxsys is a transmission rate with which data is removed from TBsys. Dn is a decoder of the elementary stream 
n. Dsys is a decoder pertinent to the system information of a program being decoded. On is a re-ordering buffer of the 
video stream n. Pn(k) is a number k presentation unit of the elementary stream. 

[0326] The decoding process for DVR-STD is explained. During the time a sole DVR MPEG 2 transport stream is 
10 being reproduced, the timing of inputting the transport packet to TB1 , TBn or TBsys is determined by arrival_time_stamp 
of the source packet. The prescriptions for the buffering operation of TB1 , MB1 , EB1 , TBn Bn, TBsys and Bsys are the 
same as those of the T-STD provided for in ISO/IEC 13818-1 , while the prescriptions for the deciding and display 
operations are also the same as the T-STD provided for in ISO/IEC 1 381 B-1 . 

[0327] The decoding process during the time the seamlessly connected PlayLists are being reproduced is now ex- 
's plained. Here, the reproduction of two AV streams referenced by the seamlessly connected Playltems is explained. In 
the following explanation, the reproduction of TS1 and TS2, shown for example in Fig. 88, is explained. TS1 and TS2 
are a previous stream and a current stream, respectively. 

[0328] Fig. 97 shows a timing chart for inputting, decoding and display of transport packets when transferring from 
a given AV stream (TS1 ) to the next AV stream seamlessly connected thereto (TS2). During transfer from a preset AV 

20 stream (TS1 ) to the next AV stream seamlessly connected thereto (TS2), the time axis of the arrival time base of TS2 
is not the same as the time axis of the arrival time base of TS1 (indicated by ATC1 in Fig. 97). 
[0329] Moreover, the time axis of the system time base of TS2 (indicated by ATC1 in Fig. 97) is not the same as the 
time axis of the system time base of TS1 (indicated by STC1 in Fig. 97). The video display is required to be continuous 
seamlessly, however, there may be overlap in the display tine of the presentation units. 

25 [0330] The input timing to DVR-STD is explained. During the time until time T1 , that is until the inputting of the last 
video packet to the TB1 of DVR-STD, the input timing to the buffers of TB1 , TBn or TBsys of DVR-STD is determined 
by arrival_time_stamp of the arrival time base of TS1 . 

[0331] The remaining packets of TS1 must be input to buffers of TBn or to TBsys of DVR-STD at a bitrate of 
TS_recording_rate (TS1). The TS_recording_rate (TS1) is the value of TS„recording_rate defined in Cliplnfo() corre- 
30 sponding to Clipl . The time the last byte of TS1 is input to the buffer is the time T2. So, during the time between time 
T1 and time T2, arrivaL_time_stamp of the source packet is discounted. 

[0332] If N1 is the number of bytes of the transport packet of TS1 next following the last video packet of TS1 , the 
time DT1 from time T1 until time T2 is the time necessary for N1 bytes to be input completely at a bitrate of 
TS_recording_rate (TS1), and is calculated in accordance with the following equation: 

35 

DT1 = T2 - T1 = N1/TS_recording_rate. 

[0333] During the time from time T1 until timeT2 (TS1), both the values of RXn and RXsys are changed to the value 
40 of TS-recording_rate (TS1). Except this rule : the buffering operation is the same as that of T-STD. 

[0334] At time T2, the arrival time clock counter is reset to the value of arrivaLtime_stamp 0 f the first source packet 
of TS2. The input timing to the buffer of TB1 , TBn or TBsys of DVR-STD is determined by arrival_time_stamp of the 
source packet of TB2. Both RXn and RXsys are changed to values defined in T-STD. 

[0335] By way of explaining additional audio buffering and system data buffering, the audio decoder and the system 
45 decoder need to have an additional buffering amount (data amount equivalent to one second) in addition to the buffer 
amount defined in T-STD in order to allow input data of a domain from time T1 to time T2. 

[0336] By way of explaining the video presentation timing, the display on the video presentation unit must be con- 
tinuous, that is devoid of gaps, through junction point. It is noted that STC1 is the time axis of the system time base of 
TS1 (indicated as STC1 in Fig.9), while STC2 is the time axis of the system time base of TS2 (shown at STC2 in Fig. 
so 97; correctly, STC2 begins at time the first PCR of TS2 has been input to the T-STD). 

[0337] The offset between STC1 and STC2 is determined as follows: the PTSIend is the PTS on STC1 corresponding 
to the last video presentation unit of TS2. PTS2start is PTS on STC2 corresponding to the first video presentation unit 
of TS2 and Tpp is the display time period of the last video presentation unit of TS1 , the offset STC_delta between two 
system time bases is calculated in accordance with the following equation: 

55 

STC_detta = PTSIend + Tpp - PTS2start. 



28 



EP1 198 133 A1 

[0338] By way of explanation of the audio presentation timing, there may be overlap in the display timing of the audio 
presentation unit, with the overlap being less than 0 to 2 audio frames (see "audio overlap" shown in Fig. 97). The 
indication as to which of audio samples is to be selected and re-synchronization of the display of the audio presentation 
unit to the corrected time base at back of the junction point are set on the player. 
5 [0339] By way of explaining the system time clock of DVR-STD, the last audio presentation unit of TS1 is displayed 
at time T5. The system time clock may be overlapped between time T2 and time T5. During this time domain, the 
DVR-STD switches the system time clocks between the value of the old time base (STC1) and the value of the new 
time base (STC2). The value of STC2 may be calculated in accordance with the following equation: 

10 

STC2 = STC1-STC_delta. 

[0340] The buffering continuity is explained. STC11video_end is the value of STC on the system time base STC2 
when the first byte of the first video packet reaches TB1 of DVR-STD. STC22video_start is the value of STC on the 
15 system time base STC2 when the first byte of the first video packet reaches TB1 of DVR-STD. STC21 video_end is 
the value of STC11video_end calculated as the value on STC2 of the system time base STC2. STC21 video_end is 
calculated in accordance with the following equation: 

20 STC21 video_end = STC1 1video_end - STC_delta. 

[0341] In order to obey DVR-STD, the following two conditions must be met: First, the arrival timing of the first video 
packet of TS2 at TB1 must satisfy the following inequality: 

25 

STC22video_start > STC21 video_end + AT1 . 

[0342] If it is necessary to re-encode and/or multiplex the partial stream of Clipl and/or Clip2, in such a manner that 
the above inequality will be satisfied, this re-encoding or multiplexing is performed as appropriate. 
30 [0343] Second, the inputting of the video packet from TS1 followed by the inputting of the video packet from TS2 on 
the time axis of the system time base, mapped from STC1 and STC2 on the same time axis, must not overflow or 
underflow the video buffer. 

[0344] If the above syntax, data structure and the rules are used as basis, the contents of data recorded on the 
recording medium or the reproduction information can be managed properly to enable the user to confirm the contents 
35 of data recorded on the recording medium at the time of reproduction or to reproduce desired data extremely readily, 
[0345] In the above-described embodiment, the MPEG 2 transport stream is taken as an example of the multiplexed 
stream. This, however, is merely exemplary, such that the MPEG 2 program stream DSS or the transport stream used 
in the DirecTV Service (trade mark) of USA may also be used as the multiplexed stream. 

[0346] Fig. 98 shows a modification of BridgeSequencelnfoQ. The difference from the BridgeSequencelnfo() of Fig. 

40 38 is that it contains only Bridge_ClipJnformation_file_name. 

[0347] Fig. 99 illustrates Bridge_Clip when two Playltems are seamlessly connected in case of using the syntax of 
BridgeSequencelnfo() of Fig. 98. RSPN_exit_from_previous_Clip is the source packet number of the source packet on 
the Clip AV stream referenced by the previous Playltem. Next to this source packet is connected the first source packet 
of the Bridge_Clip AV stream file. 

45 [0348] RSPN_enter_to_current_Clip is the number of the source packet on the Clip AV stream referenced by the 
current Playltem. Ahead of this source packet is connected the last source packet of the Bridge-Clip AV stream file 
referenced by the current Playltem. In the Bridge-Clip AV stream, shown in Fig. 99, SPN_ATC_start indicates the source 
packet number of the source packet in the Bridge-Clip AV stream file at which commences the time axis of new arrival 
time base. 

so [0349] The Bridge-Clip AV stream file has a sole arrival time base discontinuous point. The second SPN_ATC_start 
in the drawings has the same meaning as that of the RSPN_arrival_time_discontinuity in Fig.37. 
[0350] If the syntax of BridgeSequencelnfo() of Fig.98 is used, RSPN_exit_from_previous_Clip and 
RSPN_enter_to_current_C!ip are stored in the Clip Information file corresponding to the Bridge-Clip AV stream file. 
SPN_ATC_start is also stored in the Clip Information file. 

55 [0351] Fig. 100 shows the syntax of the Clip Information file when the BridgeSequencelnfo is of the syntax of Fig.98. 
Sequencelnfo_start_address indicates the leading address of Sequencelnfo(), in terms of the number of relative bytes 
from the leading byte of the Clip Information file as a unit. The number of relative bytes is counted from 0. 
[0352] Fig. 1 01 shows the syntax of CliplnfoQ of the Clip Information file of Fig. 1 00. Clip_stream_type indicates wheth- 
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er the AV stream file of the Clip is the Clip AV stream file or the Bridge-Clip AV stream file. If the Clip_stream Jype 
indicates the Bridge-Clip AV stream file, the next syntax field follows, 

[0353] Previous_ClipJnformation_file_name indicates the Clip Information file name connected ahead of the 
Bridge-Clip AV stream file. RSPN_exitJrom_previous_Clip is the source packet number of the source packet on the 
5 Clip AV stream file indicated by previous_Clip_lnformation_file_name. Next to this source packet is connected the first 
source packet of the Bridge-Clip AV stream file. This source packet number is counted as from the first source packet 
of the Clip AV stream file, with 0 as an initial value. 

[0354] Current_Clip_lnformationJile_name indicates the Clip Information file name of the Clip connected at back of 
the Bridge-Clip AV stream file. RSPN_enter_to_current_Clip is the source packet number of the source packet on the 
10 Clip AV stream file indicated by the current_Clip_lnformation_file_name. Ahead of this source packet is connected the 
last source packet of the Bridge-Clip AV stream file. This source packet number is the is counted as from the first 
source packet of the Clip with the AV stream file with 0 as an initial value. 

[0355] Fig. 102 indicates the syntax of the SequencelnfoQ of the Clip Information file of Fig. 100. 
NunrL_of_ATC_sequence indicates the number of the ATC_sequence in the AV stream file. ATC_sequence is a source 
'5 packet sequence not containing discontinuous points of the arrival time base. In the case of Bridge-Clip, this value is 2. 
[0356] SPN_ATC_start[atcJd] indicates an address at which commences the arrival time base indicated by atc_id 
on the AV stream file. SPN_ATC_start[atc_id] is of a size based on the source packet number as unit and is counted 
as from the first source packet of the AV stream file with 0 as an initial value. 

[0357] Hgs.103A and 103B illustrate database change in case stream data of the Clip AV stream file referenced by 
20 Bridge-Sequence is partially erased. It is assumed that, as indicated by "Before Editing" in Fig.103A, Clipl and Clip2 
are interconnected by Bridge-Clip, with RSPN_exit_from_previous_Clip = X and RSPN_enter_to_current_Clip = Y. 
[0358] It is assumed that a stream data portion of Z1 source packets of Clipl , shown shaded, and a stream data 
portion of Z1 source packets of Clip2, likewise shown shaded, are to be erased. As a result, 
RSPN_exitJrom_previous_Clip becomes equal to X- Z1 and RSPN_enterJo_current_Clip becomes equal to Y-Z2, 
25 by way of value changing, as indicated by "After Editing" in Fig. 1 03B. 

[0359] By changing the syntax of the database pertinent to BridgeSequence as shown in Figs. 98 and 1 01 , the infor- 
mation pertinent to the source packet number indicating a data address in the AV stream (a field commencing from 
RSPN in the database syntax) is depleted from the layer of the PlayList, such that the totality of the information on the 
source packet number is stated in the Clip layer. 
30 [0360] In this case, if it is necessary to change the value of the data address in the AV stream, as when data of the 
AV stream file has been partially erased, it is only sufficient if measly the Clip Information file is supervised as data, 
thus advantageously facilitating database management. 

[0361] Fig. 104 is a flowchart illustrating the preparation of the Real PlayList. Reference is had to the block diagram 
of Fig.1 showing the recording and/or reproducing apparatus 1 . At step S1 0, a controller 23 records a Clip AV stream. 
35 At step S1 1 , the controller 23 forms PlayList() made up of Playltems covering the totality of the possible playback range 
of the above Clip. If there is STC discontinuous point in the Clip, and Playl_ist() is made up of two or more Playltems. 
the connection_condition between the Playltems is also determined. 

[0362] At step S12, the controller 23 forms UIApplnfoPlayList(). At step S13, the controller 23 forms PlayListMark. 
At step 14, the controller 23 forms Makers PrivateData. At step S15, the controller 23 records a Real PlayList file. In 

40 this manner, one Real PlayList file is created each time a Clip AV stream file is newly recorded. 

[0363] Fig. 105 is a flowchart for illustrating the formulation of a Virtual PlayList having a bridge sequence. At step 
S20, reproduction of one Real PlayList recorded on the disc is specified through a user interface. From the reproduction 
range of the Real PlayList, the reproduction domain specified by IN and OUT points is specified through user interface. 
[0364] At step S21 , the controller 23 verifies whether or not the operation of specifying the reproduction range by 

45 the user has been finished in its entirety. If it is verified that the specifying operation has been finished, the controller 
23 proceeds to step S22 and, if otherwise, the controller 23 reverts to step S20 to repeat the subsequent processing. 
[0365] At step S22, the user determines the connection condition between two continuously reproduced Playltems 
(connection_condition), through user interface, or the controller 23 determines the connection condition. At step S23, 
the controller 23 forms a bridge sequence for the seamlessly connected Playltems. At step S54, tghe controller 23 

so forms and records the Virtual PlayList file. 

[0366] Fig. 106 is a flowchart for illustrating the detailed processing at step S23. At step S31 , the controller 23 re- 
encodes and re-multiplexes an AV stream on the OUT point side of the Playltem displayed on the temporally forward 
side. At step S32, the controller 23 re-encodes and re-multiplexes the AV stream on the IN point side of the Playltem 
represented next to the Playltem. 

55 [0367] At step S33, the controller 23 determines the value of the RSPN_exit Jrom _previous_Clip such as to satisfy 
the data allocation condition for continuous data supply. That is, RSPN_exit_from_previous_Clip must be selected so 
that the stream portion of the Clip AV stream file previous to RSPN_exit_from _previous_Clip will be recorded in a 
continuous area not less than the aforementioned half fragment on the recording medium (see Figs.91 and 94), 
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[0368] At step S34, the controller 23 determines the value of RSPN_enter_to_current_Clip such as to satisfy the 
data allocation condition for continuous data supply. That is, RSPN_enter_to_current_Clip must be selected so that 
the stream portion of the AV stream file subsequent to RSPN_enter_to_current_Clip will be arrayed in a continuous 
area not less than the aforementioned one fragment on the recording medium (see Figs.91 and 94). 
5 [0369] At step S35, the controller 23 forms the Bridge-Clip AV stream file such as to satisfy the data allocation con- 
dition for continuous data supply. That is, if the volume of data prepared in the processing of steps S31 and S32 is less 
than the size of the aforementioned one half fragment, data is copied from the original Clip to prepare Bridge-Clip (see 
Figs.91 and 94). 

[0370] Although the processing operations of steps S33 to S35 is explained chronologically, these processing oper- 
10 ations may also be carried out non-sequentially or simultaneously since these processing operations are relevant 
reciprocally. 

[0371] At step S36, the controller 23 forms a bridge sequence database. At step S37, the controller 23 records a 
Bridge-Clip AV stream file and its Clip information file. In this manner, one or more Playltem is selected by the user 
from the reproduction range of Real PlayList recorded on the disc. A bridge sequence for enabling seamless connection 
15 of two Playltems is formed and a set of one or more Playltems, grouped together, is recorded as one Virtual PlayList. 
[0372] Fig. 1 07 is a flowchart for illustrating the reproduction of PlayList. At step S41 , the controller 23 acquires Info, 
dvr, a Clip Information file, a PlayList file and the thumbnail information to prepare a GUI picture a list of PlayLists 
recorded on the disc to display the so formed picture on the GUI through user interface. 

[0373] At step S42, the controller 23 presents the information illustrating the PlayList on the GUI picture based on 
20 UlApplnfoPiayListQ of each PlayList. At step S43, the user instructs reproduction of one PlayList from the GUI picture 

through a user interface. At step S44, the controller 23 acquires, from the PTS of INJime and STC-sequecne-id of the 

current Playltem, a source packet having an entry point temporally previous and closest to IN_time. 

[0374] At step S45, the controller 23 reads out data of the AV stream from the source packet number having the 

above entry point to send the so read out data to decoder. If, at step S46, there is any Playltem temporally previous 
25 to the current Playltem, the controller 23 performs the processing of interconnecting the previous Playltem and the 

current Playltem in accordance with connection_condition. If the Playltems are interconnected seamlessly, the AV 

stream is decoded based on the DVR-STD decoding method. 

[0375] At step S47, the controller 23 commands the AV decoder 27 to start the display as from the picture of the 
PTS of IN_time. At step S48, the controller 23 commands the AV decoder 27 to continue decoding the AV stream. At 

30 step S49, the controller 23 verifies whether or not the picture currently displayed is the picture of PTS of OUTJime. If 
it is verified that the picture currently displayed is not the picture of PTS of OUTJime, the controller 23 proceeds to 
step S50 to display the picture, after which the controller 23 reverts to step S48 to repeat the subsequent processing. 
[0376] If it is verified at step S49 that the picture currently displayed is the picture of PTS of OUTJime, the controller 
23 proceeds to step S51 where the controller 23 verifies whether or not the current Playltem is the last one in the 

35 PlayList. If it is verified that the current Playltem is not the last one, the controller 23 reverts to step S44 to repeat the 
subsequent processing. If it is verified that the current Playltem is the last one, reproduction of the PlayList is finished. 
[0377] In this manner, the one PlayList file, specified to be reproduced by the user, is reproduced. 
[0378] Based on the above-described syntax, data structure and rule, as described above, the contents of data 
recorded on the recording medium, the playback information and so forth can be properly managed to enable the user 

40 to confirm the contents of data recorded on the recording medium properly to reproduce the desired data extremely 
readily. 

[0379] The above-described sequence of operations may be executed not only by hardware but also by software. If 
the sequence of operations is to be executed by software, it is installed from a recording medium to a computer in the 
dedicated hardware of which is installed the program forming the software or to e.g., a general-purpose personal 

45 computer capable of executing various functions based on a variety of programs installed therein. 

[0380] Fig. 108 shows an illustrative inner structure of a general-purpose computer. A CPU (central processing unit) 
201 of the personal computer executes a variety of processing operations in accordance with a program stored in a 
ROM (read-only memory) 202. In a RAM (random-access memory) 203, there are suitably stored data or programs 
necessary for the CPU 201 to execute various processing operations. To an input/output interface 205 is connected 

so an input unit 206 formed by a keyboard, a mouse and so forth to output signals input to the input unit 206 to the CPU 
201 . To the input/output interface 205, there is also connected an output unit 207 made up of a display, a loudspeaker 
and so forth. 

[0381] To the input/output interface 205, there are also connected a recording unit 208 made up e.g., of a hard disc 
and a communication unit 209 adapted for exchanging data with outer equipment over a network, such as Internet. A 
55 drive 21 0 is used for reading out or writing data on or from a recording medium, such as a magnetic disc 221 , an optical 
disc 222, a magneto-optical disc 223 or a semiconductor memory 224. 

[0382] The recording medium is constituted not only by a package medium distributed for furnishing the program to 
the user, in addition to a computer, such as a magnetic disc 221 carrying the program thereon, inclusive of a floppy 
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disc, an optical disc 222, inclusive of a CD-ROM (Compact Disc-Read-Only memory) or a DVD (Digital Versatile Disc), 
a magneto-optical disc 223, inclusive of a Mini-Disc, or a semiconductor memory 224, but also by a hard disc, inclusive 
of a ROM 202 carrying a program and a memory 208, furnished to the user as it is built-in in a computer, as shown in 
Fig.108. 

[0383] In the present specification, the steps of the program furnished by the medium include not only the chrono- 
logical processing in accordance with the sequence indicated, but also the processing performed not chronologically 
but in parallel or separately. 

[0384] Additionally, in the specification, the system means an entire apparatus comprised of plural component de- 
vices. 

Industrial Applicability 



[0385] In the information processing method and apparatus, and the program, according to the present invention, if 
the it is commanded to perform reproduction continuously from the first AV stream to the second AV stream, there is 

*5 generated a third AV stream made up of a preset portion of the first AV stream and a preset portion of the second AV 
stream and which is reproduced when reproduction is switched from the first AV stream to the second AV stream, while 
there is generated, as the information pertinent to the third AV stream, the address information made up of the infor- 
mation on an address of a source packet of the first AV stream at a timing of switching of reproduction from the first 
AV stream to the third AV stream, and of the information on the address of a source packet of the second AV stream 

20 at a timing of switching of reproduction from the third AV stream to the second AV stream. So, reproduction may be 
achieved such as to maintain continuity in separately recorded AV streams. 

[0386] In the information processing method and apparatus, and the program, according to the present invention, a 
first AV stream, a second AV stream or a third AV stream is read out from a recording medium, the address information 
made up of the information on an address of a source packet of the first AV stream at a timing of switching of reproduction 

25 from the first AV stream to the third AV stream, and the information on the address of a source packet of the second 
AV stream at a timing of switching of reproduction from the third AV stream to the second AV stream, is read out from 
the recording medium as the information pertinent to the third AV stream, and reproduction is switched from the first 
AV stream to the third AV stream and from the third AV stream to the second AV stream, as reproduction proceeds, 
based on the read-out information pertinent to the third AV stream. So, reproduction may be achieved such as to 

30 maintain continuity in separately recorded AV streams. 

Claims 

35 1. An information processing apparatus comprising: 

generating means for generating, in case continuous reproduction from a first AV stream to a second AV stream 
is commanded, a third AV stream made up of a preset portion of said first AV stream and a preset portion of 
said second AV stream, said third AV stream being reproduced when reproduction is switched from said first 

*o AV stream to said second AV stream, and the address information, as the information pertinent to said third 

AV stream, said address information being made up of the information on an address of a source packet of 
said first AV stream at a timing of switching of reproduction from said first AV stream to said third AV stream, 
and of the information on the address of a source packet of said second AV stream at a timing of switching of 
reproduction from said third AV stream to said second AV stream; and 

45 recording means for recording said third AV stream and said address information, as generated by said gen- 

erating means. 

2. The information processing apparatus according to claim 1 wherein an arrival time stamp of the source packet of 
said first AV stream included in said address information generated by said generating means, and an arrival time 
so stamp of a source packet located at a leading end of said third AV stream, are continuous to each other, and 

wherein an arrival time stamp of the source packet of said second AV stream included in said address information 
generated by said generating means and an arrival time stamp of a source packet located at a trailing end of said 
third AV stream are continuous to each other. 

55 3. The information processing apparatus according to claim 2 wherein a sole discontinuous point exists in the arrival 
time stamp of the source packet in said third AV stream. 

4. The information processing apparatus according to claim 2 wherein said address is determined so that a data 
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portion of the AV stream previous to a source packet specified by the information on the address of the source 
packet of said first AV stream contained in said address information generated by said generating means will be 
located in a continuous area of not less than a preset size on a recording medium. 

5 5. The information processing apparatus according to claim 2 wherein said address is determined so that a data 
portion of the AV stream subsequent to a source packet specified by the information on the address of the source 
packet of said second AV stream contained in said address information generated by said generating means will 
be located in a continuous area of not less than a preset size on a recording medium. 

10 6. The information processing apparatus according to claim 2 wherein said third AV stream is generated so that said 
third AV stream will be located in a continuous area of not less than a preset size on said recording medium. 

7. An information generating method comprising: 

15 a step of generating, in case continuous reproduction from a first AV stream to a second AV stream is com- 

manded, a third AV stream made up of a preset portion of said first AV stream and a preset portion of said 
second AV stream, said third AV stream being reproduced when reproduction is switched from said first AV 
stream to said second AV stream, and 

a step of generating the address information, as the information pertinent to said third AV stream, said address 
20 information being made up of the information on an address of a source packet of said first AV stream at a 

timing of switching of reproduction from said first AV stream to said third AV stream, and of the information on 
the address of a source packet of said second AV stream at a timing of switching of reproduction from said 
third AV stream to said second AV stream. 

25 8. A recording medium having recorded thereon a computer-readable program, said program comprising: 

a step of generating, in case continuous reproduction from a first AV stream to a second AV stream is com- 
manded, a third AV stream made up of a preset portion of said first AV stream and a preset portion of said 
second AV stream, said third AV stream being reproduced when reproduction is switched from said first AV 

30 stream to said second AV stream, and 

a step of generating the address information, as the information pertinent to said third AV stream, said address 
information being made up of the information on an address of a source packet of said first AV stream at a 
timing of switching of reproduction from said first AV stream to said third AV stream, and of the information on 
the address of a source packet of said second AV stream at a timing of switching of reproduction from said 

35 third AV stream to said second AV stream. 

9. A program for having a computer execute a program, said program comprising: 

a step of generating, in case continuous reproduction from a first AV stream to a second AV stream is com- 
40 manded, a third AV stream made up of a preset portion of said first AV stream and a preset portion of said 

second AV stream, said third AV stream being reproduced when reproduction is switched from said first AV 
stream to said second AV stream, and 

a step of generating the address information, as the information pertinent to said third AV stream, said address 
information being made up of the information on an address of a source packet of said first AV stream at a 
4 5 timing of switching of reproduction from said first AV stream to said third AV stream, and of the information on 

the address of a source packet of said second AV stream at a timing of switching of reproduction from said 
third AV stream to said second AV stream. 

10. An information processing apparatus comprising: 

50 

first readout means for reading out a first AV stream, a second AV stream or a third AV stream from a recording 
medium; 

second readout means for reading out, as the information pertinent to said third AV stream, the information 
on an address of a source packet of said first AV stream at a timing of switching of reproduction from said first 
55 AV stream to said third AV stream, and of the information on the address of a source packet of said second 

AV stream at a timing of switching of reproduction from said third AV stream to said second AV stream; and 
reproducing means for performing reproduction as reproduction is switched from said first AV stream read out 
by said first readout means to said third AV stream and from said third AV stream to said second AV stream, 
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based on the information pertinent to said third AV stream read out by said second readout means. 

11. An information processing method comprising: 

5 a first readout controlling step of reading out a first AV stream, a second AV stream or a third AV stream from 

a recording medium; 

a second readout controlling step of reading out, as the information pertinent to said third AV stream, the 
information on an address of a source packet of said first AV stream at a timing of switching of reproduction 
from said first AV stream to said third AV stream, and of the information on the address of a source packet of 
10 said second AV stream at a timing of switching of reproduction from said third AV stream to said second AV 

stream; and 

a reproducing step of performing reproduction as reproduction is switched from said first AV stream read out 
by said first readout means to said third AV stream and from said third AV stream to said second AV stream, 
based on the information pertinent to said third AV stream read out by said second readout means. 

15 

12. A recording medium having recorded thereon a computer-readable program, said program comprising: 

a first readout controlling step of reading out a first AV stream, a second AV stream or a third AV stream from 
a recording medium; 

20 a second readout controlling step of reading out, as the information pertinent to said third AV stream, the 

information on an address of a source packet of said first AV stream at a timing of switching of reproduction 
from said first AV stream to said third AV stream, and of the information on the address of a source packet of 
said second AV stream at a timing of switching of reproduction from said third AV stream to said second AV 
stream; and 

25 a reproducing step of performing reproduction as reproduction is switched from said first AV stream read out 

by said first readout means to said third AV stream and from said third AV stream to said second AV stream, 
based on the information pertinent to said third AV stream read out by said second readout means. 

13. A program for having a computer execute a program, said program comprising: 

30 

a first readout controlling step of reading out a first AV stream, a second AV stream or a third AV stream from 
a recording medium; 

a second readout controlling step of reading out, as the information pertinent to said third AV stream, the 
information on an address of a source packet of said first AV stream at a timing of switching of reproduction 
35 from said first AV stream to said third AV stream, and of the information on the address of a source packet of 

said second AV stream at a timing of switching of reproduction from said third AV stream to said second AV 
stream; and 

a reproducing step of performing reproduction as reproduction is switched from said first AV stream read out 
by said first readout means to said third AV stream and from said third AV stream to said second AV stream, 
40 based on the information pertinent to said third AV stream read out by said second readout means. 

14. A recording medium having recorded thereon the address information comprising, in case continuous reproduction 
from a first AV stream to a second AV stream is commanded, a third AV stream made up of a preset portion of 
said first AV stream and a preset portion of said second AV stream, said third AV stream being reproduced when 

45 reproduction is switched from said first AV stream to said second AV stream, and the address information, as the 

information pertinent to said third AV stream, said address information being made up of the information on an 
address of a source packet of said first AV stream at a timing of switching of reproduction from said first AV stream 
to said third AV stream, and of the information on the address of a source packet of said second AV stream at a 
timing of switching of reproduction from said third AV stream to said second AV stream. 

50 
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REFERENCED IN PlayListMark 



USER SPECIFIES MARK POINT OF REPLAY 
START POINT 
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IF CPLtype IS EP_mapTYPE, ACQUIRE MARK 
PTS AND Playltem_id IT BELONGS, IF CPLtype IS 
TU.map TYPE, ACQUIRE MARK ATS 
(Arrival Time Stamp) 
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IF CPLtype IS EP.mapTYPE, ACQUIRE 
STC-sequence-id OF AV STERAM REFERENCED 
BY Playltem SPECIFIED BY Playltemjd 
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IF CPLtype IS EP_mapTYPE, INPUT TO 
DECODER AV STREAM BASED ON ABOVE 
STC-sequence-id AND MARK PTS. IF CPLtype 
ISTU.mapTYPE, INPUTTO DECODER AV 
STREAM BASED ON MARK ATS 



IF CPLtype IS EP_mapTYPE, START DISPLAY 
FROM PICTURE OF MARK POINT PTS. IF 
CPLtype ISTU_mapTYPE, START DISPLAY FROM 
PICTURE SUBSEQUENT TO MARK POINT ATS. 
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